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TITLE OF THE INVENTION 

ASSAYS FOR NUCLEAR RECEPTOR LIGANDS USING FRET 
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CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional 
Application No. 60/061,385, filed 10/7/97, the contents of which are 
incorporated herein by reference in their entirety. 

10 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 

Not applicable. 

REFERENCE TO MICROFICHE APPENDIX 
15 Not appUcable. 

FIELD OF THE INVENTION 

This invention relates to methods of identifying novel . 
agonists and antagonists of nuclear receptors utilizing the agonist- 
20 dependent interaction of such receptors with CREB-binding protein 

(CBP) or other nudear receptor co-activators in which this interaction is 
detected by fluorescence resonance energy transfer 

BACKGROUND OF THE INVENTION 

25 Nuclear receptors are a superfamily of ligand-activated 

transcription factors that bind as homodimers or heterodimers to their 
cognate DNA elements in gene promoters. The superfamily, with more 
than 150 members, can be divided into subfamilies {e.g. the steroid, 
retinoid, Hiyroid hormone, and peroxisome proUferator-activated 

30 [PPAR] subfamilies). Each subfamily may consist of several members 
which are encoded by individual genes (e.g. PPARa, PPARy, and 
PPARS). In addition, alternative mRNA splicing can result in more 
than one isoform of these genes as in the case of spedfic PPARs ie.g. 
PPAR7I and PPAR72). The nudear receptor superfamily is involved in 

35 a wide variety of phjrsiological functions in mammalian cells: e.g.f 

differentiation, proliferation, and metabolic homeostasis. Dysfunction 
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or altered expression of specific nuclear receptors has been found to be 
involved in disease pathogenesis. 

The PPAK subfamily of nuclear receptors consists of three 
members: PPARa, PPARy, and PPAR5. PPARa is highly expressed in 
5 liver and kidney. Activation of PPARa by peroxisome proliferators 
(including hypolipidimic reagents such as fibrates) or medium and 
long-chain fatty acids is responsible for the induction of acyl-CoA 
oxidase and hydratase-dehydrogenase (enzymes required for 
peroxisomal P-oxidation), as well as cytochrome P450 4A6 (an enzyme 

10 required for fatty acid 0)-hydroxylase). Thus, PPARa has an important 
role in the regulation of lipid metabolism and is part of the mechanism 
through which hypolipidimic compoimds such as fibrates exert their 
effects. PPARy is predominantly expressed in adipose tissue. Recently, 
a prostaglandin J2 metabolite, 15-Deoxy-D12,14-prostaglandin J2, has 

15 been identified as a potential physiological ligand of PPARy. Both 15- 
Deoxy-D12,14-prostaglandin J2 treatment of preadipocytes or retroviral 
expression of PPAR72 in fibroblasts induced adipocjrte differentiation, 
demonstrating the role of PPARy in adipocyte differentiation and lipid 
storage. The demonstration that anti-diabetic and lipid-lowering 

20 insulin sensitizing compounds known as thiazolidinediones are high 
affinity Hgands for PPARy suggests a broad therapeutic role for PPARy 
ligands in the treatment of diabetes and disorders associated with 
insulin resistance {e.g, obesity and cardiovascular disease). 

Nuclear receptor proteins contain a central DNA binding 

25 domain (DBD) and a COOH-terminal Ugand binding domain (LED). The 
DBD is composed of two highly conserved zinc fingers that target the 
receptor to specific promoter/enhancer DNA sequences known as 
hormone response elements (HREs). The LBD is about 200-300 amino 
adds in length and is less well conserved than the DBD. There are at 

30 least three functions for the LBD: dimerization, ligand binding, and 
transactivation. The transactivation function can be viewed as a 
molecular switch between a transcriptionally inactive and a 
transcriptionally active state of the receptor. Binding of a ligand which 
is an agonist flips the switdi firom the inactive state to the active state. 

35 The COOH-terminal portion of the LBD contains an activation function 
domain (AF2) that is required for the switch. 

-2- 
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The ligand-induced nuclear receptor molecular switch is 
mediated through interactions with members of a family of nuclear 
receptor co-activators ie.g., CBP/p300, SRC-l/NcoA-l, TIP2/GRIP- 
l/NcoA-2, and p/CIP). Upon binding of agonist to its cognate receptor 

5 LBD, a conformational change in the receptor protein creates a co- 
activator binding surface and results in recruitment of co-activator(s) to 
the receptor and subsequent transcriptional activation. The binding of 
antagonist ligands to nuclear receptors will not induce the required 
conformational change and prevents recruitment of co-activator and 

10 subsequent induction of transcription. The co-activators CREB-binding 
protein (CBP) and p300 are two closely related proteins that were 
originally discovered by virtue of their ability to interact with the 
transcription factor CREB. These two proteins share extensive amino 
acid sequence homology* CBP can form a bridge between nuclear 

IS receptors and the basic transcriptional machinery (Kamei et al., 1996, 
Cell 85:403-414; Chakravarti et al., 1996, Nature 383:99-103; Hanstein et 
al., 1996, Proc. Natl. Acad. Sd. USA 93:11540-11545; Heery et al., 1997, 
Nature 387:733-736). CBP also contains intrinsic histone 
acetyltransferase activity which could restdt in local chromatin 

20 rearrangement and further activation of transcription. Ligand- and 
AF2-dependent interaction between certain nuclear receptors and CBP 
has been demonstrated in in vitro pull down assays and far-western 
assays. This interaction is both necessary and sufficient for the 
transcriptional activation that is mediated by these nuclear receptors. 

25 Thus, an AF2 mutant of the estrogen receptor (£R) which abolishes the 
transchptonal function of the receptor is incapable of interacting with 
CBP. 

The N-termini of CBP and p300 have been shovm to interact 
with the ligand-binding domains of some nuclear receptors (Kamei et 
30 al., 1996, Cell 85:403-414, hereinafter 'ICamei''). Kamei was able to 

demonstrate direct interaction of CBP and p300 with nuclear receptors 
by several different methods: 

(1) Kamei produced GST fusion proteins of the first 100 
amino adds of the N-terminus of CBP. These fusion proteins were run 
35 out on a polyacrylamide gel, transferred to a membrane, and tiie 
membrane was exposed to 32p.labeled ligand-binding domains of 

-3- 
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nudear receptors. In the presence of ligand, a specific binding 
interaction between the CBP and nuclear receptor fragments was 
detected in that the 32p-labeled ligand-binding domains were observed to 
bind to the bands on the membrane containing the GST-CBP fusion 
5 proteins. 

(2) Kamei also utilized the yeast two-hybrid system. The 
ligand-binding domain of the nuclear receptor fused to the DNA-binding 
domain of the LexA protein was used as bait. The amino terminal 
domain of CBP fused to the gal4 transactivation domain was used as 

10 prey. In the presence of Uganda a specific binding interaction (occurring 
in vivo, i.e., within the yeast) was observed between the CBP and nuclear 
receptor fragments. 

(3) Kamei observed ligand-induced binding between CBP 
and nuclear receptors via a gel-shift assay. This assay is based on the 

15 observation that, in the presence of ligand, nuclear receptors will bind to 
oligonucleotides containing their target recognition sequence. Such 
binding results in the formation of a nudear receptor-ligand- 
oligonucleotide complex having a higher molecular weight than the 
oligonucleotide alone. This difference in molecular weight is detected 

20 via a shift in position of the 32p-labeled oligonucleotide when it is run out 
on a polyacrylamide gel. Kamei foimd that a fragment of CBP (the N- 
terminal 100 amino adds) was capable of binding to the nuclear 
receptor-ligand-oligonudeotide complex and shifting the complex's 
position on the gel to an even higher molecular weight. 

25 (4) Kamei was able to co-immunopredpitate CBP using 

antibodies to nuclear receptors in extracts from a variety of cells in the 
presence of ligand. 

(5) By the use of transcriptional activation assays^ Kamei 
was able to demonstrate that nudear receptors and CBP interact in a 

30 functional manner. Such transcriptional activation assays can indicate 
that two proteins are involved in a pathway that results in 
transcriptional activation but ihese assays do not prove that the 
interaction between the proteins is one of direct binding. 

By the above«described methods, Kamei was able to 

35 demonstrate spedfic binding interactions between CBP and the retinoic 
add receptor (RAR), glucocorticoid receptor (GR), thyroid hormone 

-4- 
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receptor (T3R), and retinoid X receptor (RXR)» Kamei also demonstrated 

specific binding between the N-terminus of p300 and RAR. However, 
Kamei did not demonstrate specific binding between CBP, p300, or any 
other nuclear receptor co-activators and PPARs. 

5 What is striking about the methods used by Kamei is their 

extremely laborious and time consimiing nature. Such methods 
involve, among other things, the construction of fiision proteins, the 
preparation of 32p.labeled proteins, the construction of specialized 
expression vectors for the yeast two-hybrid assay and the transcriptional 

10 activation assays, the running of many gels, and the raising of 

antibodies. Most of these assays take days to carry out and preparing the 
reagents needed to carry them out may take weeks. Because of the 
complicated reagents that are involved in these assays and the time 
needed to prepare and run the assays, these assays tend to be costiy. 

IS Investigators other than Kamei who have studied the interaction 

between nuclear receptors and CBP have also been forced to rely on such 
cumbersome methods (see, e.g., Chakravarti et al., 1996, Nature 383:99- 
103; Hanstein et al., 1996, Proc. Nail. Acad. Sd. USA 93:11540-11545; 
Heery et al., 1997, Nature 387:733-736). 

20 Kamei did not use the above-described methods to identify 

novel agonists or antagonists of nudear receptors. The focus of Kamei 
was not on agonists or antagonists, but rather on tiie interaction 
between nuclear receptors and CBP. Although modifying the methods 
of Kamei to identify agonists or antagonists might be possible, such 

25 methods would suffer from serious disadvantages. This is because, as 
discussed above, all of the assays employed by Kamei to study the 
interaction of CBP and p300 with nuclear receptors are very laborious, 
slow, and costly. Given the therapeutic importance of steroid hormones 
such as estrogen, Cortisol, progesterone, and other nuclear receptor 

30 agonists such as tiiyroid hormone and antidiabetic thiazolidinedione 
compounds, the need for improved high-throughput screening assays to 
identify potential pharmaceutical compounds affecting nuclear 
receptors is clear. Historically, therapeutically useful nuclear receptor 
ligand compoimds were identified by screening animal models, an 

35 approach which is even more labor intensive and time consuming than 

the methods used by Kamei. Also, approaches such as those used by 

-5- 
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Kamei are ill-suited for the identification of antagonists of nuclear 
receptors. It is now widely appreciated that antagonists of nuclear 
receptors can be valuable therapeutic agents. Examples of such 
therapeutically useful antagonists are tamoxifene, raloxifene, and RU- 
486. 

What is needed is a high throughput, time and labor- 
saving, non-radioactive, inexpensive, and very reliable assay for the 
identification and characterization of both agonists and antagonists of 
nuclear receptors. Such an assay is provided by the present invention. 



SUMMARY OF THE INVENTION 

The present invention provides novel methods of identifying 
agonists and antagonists of nuclear receptors. The methods take 
advantage of the agonist-dependent binding of nuclear receptors and 

15 GBP, p300, or other nuclear receptor co-activators. In the absence of 
agonist, binding between the nuclear receptor and GBP, p300, or other 
nuclear receptor co-activators does not occur. If agonist is present, 
however, such binding occurs and can be detected by fluorescence 
resonance energy transfer (FRET) between a fluorescently-labeled 

20 nudear receptor and fluorescently-labeled GBP, p300, or other nuclear 
receptor co-activator. Antagonists can be identified by virtue of their 
ability to prevent or disrupt the agonist-induced interaction of nuclear 
receptors and GBP, p300, or other nuclear receptor co-activators. In 
contrast to prior art methods of identifying agonists and antagonists of 

25 nuclear receptors, the methods of the present invention, are simple, 
rapid, and less costiy. 

The present invention provides a nuclear receptor or ligand 
binding domain thereof labeled with a fluorescent reagent for use in the 
above-described methods of identifying agonists and antagonists of 
30 nuclear receptors. The present invention also provides GBP, p300, or 
other nuclear receptor co-activator, or a binding portion thereof, labeled 
with a fluorescent reagent. 



-6- 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates a method of fluorescently labelling a 
protein or polypeptide with Europium cryptate (Eu3+K). 

Figure 2 illustrates the format for experiments 1 and 2 of 



Table 1. 



Table 1. 



10 Table 1. 



Figure 3 iUustrates the format for experiment 3 of 

Figure 4 illustrates the format for experiment 4 of 

Figure 5 shows the results of studies using the methods of 
the present invention with four known PPARy agonists, -o- = AD5075; 
= PiogUtazone; -X - = Troglitazone; -0- = BRL49653. 

Figure 6 shows a measurement of the binding constant for 
15 the interaction between hCBP and PPARylLBD. 

Figure 7A shows the amino add sequence of human GBP 
(SEQ.ID.N0.:1). 

Figure 7B shows the nucleotide sequence of a cDNA 
encoding human GBP (S£Q.ID.N0.:2). The open reading frame is at 
20 positions 76-1290. 

Figure 8A shows the amino add sequence of human 
PPARa (SEQ,ID.N0.:3). 

Figure 8B shows the nudeotide sequence of a cDNA 
encoding human PPARa (SEQ.ID.N0.:4). The open reading frame is at 
25 positions 217-1623. 

Figure 9A shows the amino add sequence of human 
PPARyl (SEQJD.N0.:5). 

Figure 9B shows the nudeotide sequence of a cDNA 
encoding human PPARyl (S£Q.ID.N0.:6). The open reading frame is at 
30 positions 173-1609. 

Figure lOA shows the amino add sequence of human 
PPAR5 (SEQ.ID.N0.:7), 

Figure lOB-G shows the nudeotide sequence of a cDNA 
encoding human PPARS (SEQ.ID.N0.:8). The open reading frame is at 
35 positions 338-1663. 
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DETAILED DESCRIPTION OF THE INVENTION 

For the purposes of this invention: 

- an ^agonist" is a substance that binds to nuclear receptors 
in such a way that a specific binding interaction between the nuclear 

5 receptor and CBP or other nuclear receptor co-activator can occur. 

- an ""antagonist" is a substance that is capable of preventing 
or disrupting the agonist-induced specific binding interaction between a 
nuclear receptor and CBP, p300, or another nuclear receptor co- 
activator. 

10 - a "ligand" of a nuclear receptor is an agonist or an 

antagonist of the nuclear receptor. 

- a ^specific binding interaction,^ ""specific binding,'^ and 
the like, refers to binding between a nuclear receptor and CBP, p300, or 
other nuclear receptor co-activator which results in the occurrence of 

IS fluorescence resonance energy transfer between a fluorescent reagent 
bound to the nudear receptor and a fluorescent reagent botrnd to CBP, 
p300, or other nuclear receptor co-activator. 

With respect to CBP, p300, or other nuclear receptor co- 
activators, a "T)inding portion" is that portion of CBP, p300, or other 

20 nuclear receptor co-activators that is sufficient for specific binding 
interactions with nuclear receptors. 

With respect to nudear receptors, a "ligand binding 
domain" is that portion of a nudear receptor that is suffident to bind an 
agonist or antagonist of the nuclear receptor. 

25 The present invention provides a high throughput, time 

and labor-saving, non-radioactive, inexpensive, and very reliable assay 
for the identification and characterization of both agonists and 
antagonists of nuclear receptors. In a general embodiment, the present 
invention provides methods of identifying agonists and antagonists for 

30 any nuclear receptor for which CBP, p300, or another nuclear receptor 
binding protein is a co-activator. Such agonists and antagonists are 
identified by virtue of their ability to induce or prevent binding between 
the ligand binding domain of a nuclear receptor and CBP, p300, or other 
nudear receptor co-activator. The interaction between the nuclear 

35 receptor and CBP, p300, or other nudear receptor co-activator is 

monitored by observing the occurrence of fluorescence resonance energy 

-8- 
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transfer (FRET) between two fluorescent reagents. One fluorescent 
reagent is bound to the nuclear receptor; the other fluorescent reagent is 
bound to GBP, p300, or other nuclear receptor co-activator. The binding 
of fluorescent reagent to nuclear receptor, GBP, p300» or other nuclear 
5 receptor co-activator can be by a covalent linkage or a non-covalent 
linkage. 

The present invention makes use of fluorescence resonance 
energy transfer (FRET). FRET is a process in which energy is 
transferred from an excited donor fluorescent reagent to an acceptor 

10 fluorescent reagent by means of intermolecular long-range dipole-dipole 
coupling. FRET typically occurs over distances of about 10^ to 1006 and 
requires that the emission spectrum of the donor reagent and the 
absorbance spectrum of the acceptor reagent overlap adequately and that 
the quantum yield of the donor and the absorption coefficient of the 

15 acceptor be sufficiently high. In addition, the transition dipoles of the 
donor and acceptor fluorescent reagents must be properly oriented 
relative to one another. For a review of FRET and its applications to 
biological systems, see Clegg, 1995, Current Opinions in Biotechnology 
6:103-110. 

20 The present invention makes use of a nuclear receptor or 

ligand binding domain thereof labeled with a first fluorescent reagent 
and CBP, p300, or other nuclear receptor co-activator, or a binding 
portion thereof, labeled with a second fluorescent reagent. The second 
fluorescent reagent comprises a fluorophore capable of undergoing 

25 energy transfer by either (a) donating excited state energy to the first 
fluorescent reagent, or (b) accepting excited state energy from the first 
fluorescent reagent. In other words, according to the present invention, 
either the first or the second fluorescent reagents can be the donor or the 
acceptor during FRET. 

30 The first and second fluorescent reagents are 

spectropscopically complementary to each other. This means that their 
spectral characteristics are such that excited state energy transfer can 
occur between them. FRET is highly sensitive to the distance between 
the first and second fluorescent reagents. For example, FRET varies 

35 inversely with the sixth power of the distance between the first and 
second fluorescent reagents. In the absence of agonist, the first 

-9- 
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fluorescent reagent, bound to the nudear receptor or ligand binding 
domain thereof, will not be near the second fluorescent reagent, bound to 
CBP, p300, or other nudear receptor co-activator, or binding portion 
thereof. Thus, no FRET, or very Uttle FRET, will be observed. In the 
5 presence of agonist, however, interaction between the nuclear receptor 
and CBP, p300, or other nudear receptor co-activator will occur, thus 
bringing dose together the first and the second fluorescent reagents, 
allowing FRET to occur and be observed. 

Accordingly, the present invention provides a method of 
10 identifying an agonist of a nuclear receptor that comprises providing: 

(a) a nudear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 

(b) CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a second fluorescent reagent; and 

15 (c) a substance suspected of being an agonist of the 

nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nudear receptor or hgand 
binding domain thereof and CBP, pSOO, or other nuclear receptor co- 

20 activator, or a binding portion thereof, will occur; and 

(d) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nudear receptor. 

25 In particular embodiments, the nuclear receptor is selected 

firom the group consisting of steroid receptors, thyroid hormone 
receptors, retinoic add receptors, peroxisome proliferator-activated 
receptors, retinoid X receptors, glucocorticoid receptors, vitamin D 
receptors, and ""orphan nudear receptors" such as LXR, FXR, etc. 

30 In a particular embodiment, the nuclear receptor or ligand 

binding domain thereof is a fiill-length nudear receptor. In another 
embodiment, the nudear receptor or ligand binding domain thereof is a 









1 



the nudear receptor or ligand binding domain thereof comprises an AP- 
35 2 site of a nudear receptor. 

-10- 
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In a partictdar embodiment^ the nuclear receptor or ligand 
binding domain thereof is a full-length PPAR. In another embodiment, 
the nuclear receptor or ligand binding domain thereof is the ligand 
binding domain of a PPAR. In a further embodiment, the PPAR is 
5 selected from the group consisting of PPARa, PPARyl, PPAR72, and 
PPAR5. In a further embodiment, the ligand binding domain of the 
PPAR contains amino acid residues 176-478 of himian PPARyl. 

In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof contains amino adds 143-462 of human RARa. 
10 In another embodiment, the nuclear receptor or ligand binding domain 
thereof contains amino adds 122-410 of rat TaRal. In another 

embodiment, the nuclear receptor or ligand binding domain thereof 
contains amino adds 227-463 of mouse RXR7, In another embodiment, 
the nuclear receptor or ligand binding domain thereof contains amino 

IS adds 251-595 of human £R. 

In a particluar embodiment, the above-described methods 
utilize full-length CBP, either mouse or human. In other embodiments, 
the methods utilize amino add residues 1-113 of human CBP. In 
another embodiment, the methods utilize amino add residues 1-453 of 

20 himian CBP. 

The conditions imder which the methods described above 
are carried out are conditions that are typically used in the art for the 
study of protein-protein interactions: e.g., physiological pH; salt 
conditions such as those represented by such commonly used bufifers as 
25 PBS; a temperature of about 4''C to about 55*'C. The presence of 

commonly used non-ionic detergents, e.g., NP.40®, sarcosyl, Triton X- 
100®, is optional. When europium cryptates are used as fluorescent 

reagents, reactions should contain KF at a concentration of at least 200 
mM. 

30 Heery at al., 1997, Nature 387:733-736 showed that 

interactions between nudear receptors and a variety of nudear receptor 
co-activators are mediated by a short amino add sequence in the nuclear 
receptor co-activators having the amino add sequence LXXLL, where L 
is leucine and X represents any amino add. Accordingly, the present 

35 invention can be practiced with a binding portion of a nuclear receptor 
co-activator, provided that the binding portion contains the amino add 
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sequence LXXLL. Therefore, the present invention includes a method of 
identifying an agonist of a nuclear receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 
5 (b) a binding portion of a nuclear receptor co-activator, 

where the binding portion contains the amino acid sequence LXXLL, 
and where the binding portion is labeled with a second fluorescent 
reagent; and 

(c) a substance suspected of being an agonist of the 
10 nuclear receptor; 

imder conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and the binding portion of the nuclear receptor 
co-activator will take place; and 
15 (d) measuring fluorescence resonance energy transfer 

(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

In a particular embodiment, the nuclear receptor co- 
20 activator is selected from the group consisting of: human RIP- 140, 

human SRC-1, mouse TIF-2, human or mouse CBP, human or mouse 
p300, mouse TIF-1, and human TRIP proteins. 

In a particular embodiment, the nuclear receptor co- 
activator is human RIP- 140 and the binding portion includes a 
25 contiguotis stretch of amino adds of human RIP- 140 selected fi*om the 
group consisting of: positions 20-29, 132-139, 184-192, 266-273, 379-387, 
496-506, 712-719, 818-825, 935-944, and 935-942. 

In another embodiment, the nuclear receptor co-activator is 
human SRC-1 and the binding portion includes a contiguous stretch of 
30 amino adds of human SRC-1 selected firom the group consisting of: 
positions 45-53, 632-640, 689-696, 748-755, and 1434-1441. 

In another embodiment, the nudear receptor co-activator is 
mouse TIF-2 and the binding portion includes a contiguous stretch of 
amino adds of mouse TIF-2 selected firom the group consisting of: 
35 positions 640*650, 689-699, and 744-754. 
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In another embodiment^ the nuclear receptor co-activator is 
human or mouse CBP and the binding portion includes a contiguous 
stretch of amino adds of human or mouse CBP selected from the group 
consisting of: positions 68-78 and 356-366. 
S In another embodiment^ the nuclear receptor co-activator is 

human or mouse p300 and the binding portion includes a contiguous 
stretch of amino adds of human or mouse p300 selected from the group 
consisting of: positions 80-90 and 341-351. 

In another embodiment, the nuclear receptor co-activator is 
10 mouse TIF-1 and the binding portion includes a contiguous stretch of 
amino adds of mouse TIF-1 containing positions 722-732. 

In another embodiment, the nuclear receptor co*activator is 
human TRIP2 and the binding portion includes a contiguous stretch of 
amino adds of human TRIP2 containing positions 23-33. 
15 In anotiier embodiment, the nuclear receptor co-activator is 

hviman TRIPS and the binding portion includes a contiguous stretch of 
amino acids of human TRIPS containing positions 97-107, 

In another embodiment, the nuclear receptor co-activator is 
human TRIP4 and the binding portion includes a contiguous stretch of 
20 amino adds of human TRIP4 containing positions 36-46. 

In another embodiment, the nuclear receptor co-activator is 
htunan TRIP5 and the binding portion includes a contiguous stretch of 
amino adds of human TRIPS containing positions 26-36. 

In another embodiment, the nudear receptor co-activator is 
25 httman TRIPS and the binding portion includes a contiguous stretch of 
amino adds of human TRIPS containing positions 36-46. 

In another embodiment, the nuclear receptor co-activator is 
himian TRIP9 and the binding portion includes a contiguous stretch of 
amino adds of human TRIP9 selected from the group consisting of: 
30 positions 73-83, 256-266 and 288-298. 

For amino add sequences of nudear receptor co-activators, 
see Yao et al., 1996, Proc. Nati. Acad. Sd. USA 93:10626-10631 (SRC-1); 
0§ate et al., 1995, Sdence 270:1354-1357 (SRC-1); Cavaill&s et al., 1995, 
EMBO J. 14:3741-3751 (RIP-140); Voegel etal., 1996, EMBO J. 15:101-108 
35 (TIF-2); Kwok et al., 1994, Nature 370:223-226 (CBP); Arias et al., 1994, 
Nature 370:226-229 (CBP); Eckner et al., 1994, G^nes Dev. 8:869-884 
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(p300); Le Douarin et al., 1995, EMBO J. 14:2020.2033 (TIF-l); Lee et al., 
1995, Nature 374:91-94 (TRIP proteins). 

The particular embodiments of the present invention 
described above are all particular embodiments of a more general 
5 method that is also part of the present invention. That general method 
is a method of identi^dng an agonist of a nudear receptor that 
comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 
10 (b) a polypeptide containing the amino add sequence 

LXXLL where the polypeptide is labeled with a second fluorescent 
reagent; and 

(c) a substance suspected of being an agonist of the 
nuclear receptor; 

IS under conditions such that, if the substance is an agonist of 

the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and the polypeptide will take place; and 

(d) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents; 

20 where the occurrence of FRET indicates that the substance 

is an agonist of the nuclear receptor. 

In a particular embodiment, the amino acid sequence 
LXXLL is present in an a helical portion of the polypeptide. In another 
embodiment^ the amino add sequence LXXLL is present in an a helical 
25 portion of the polypeptide and the leucines form a hydrophobic face. 

The present invention provides methods for identifying 
antagonists of a nuclear receptor. Such methods are based on the ability 
of the antagonist to prevent the occurrence of agonist-induced binding 
between a nudear receptor and CBP, p300, or other nudear receptor co- 
30 activator, or to disrupt such binding after it has occurred. Thus, the 
present invention provides a method for identifying antagonists of 
nuclear receptors that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 
35 (b) CBP, pSOO, or other nudear receptor co-activator, or a 

binding portion thereof, labeled with a second fluorescent reagent; 
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(c) an agonist of the nuclear receptor; and 

(d) a substance suspected of being an antagonist of the 
nuclear receptor; 

under conditions such that, in the absence of the substance, 
5 binding between the nuclear receptor or ligand binding domain thereof 
and CBP, p300, or other nuclear receptor co-activator» or a binding 
portion thereof will occur; and 

(e) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents when the 

10 substance is present and measuring FRET between the first and second 
fluorescent reagents when the substance is absent; 

where the a decrease in FRET when the substance is 
present indicates that the substance is an antagonist of the nuclear 
receptor. 

1^ In particular embodiments, the nuclear receptor is selected 

from the group consisting of steroid receptors, thyroid hormone 
receptors, retinoic add receptors, peroxisome proliferator-activated 
receptors, retinoid X receptors, glucocorticoid receptors, vitamin D 
receptors, and **orphan nuclear receptors'* such as LXR, FXR, etc. 

20 In a particular embodiment, the nuclear receptor or ligand 

binding domain thereof is a full-length nuclear receptor. In another 
embodiment, the nuclear receptor or Ugand binding domain thereof is a 
ligand binding domain of a nuclear receptor. In another embodiment, 
the nudear receptor or hgand binding domain thereof is an AF-2 site of 

25 a nudear receptor. 



mm 


1 


rrn 


• 



binding domain thereof is a full-length PPAR. In another embodiment, 
the nuclear receptor or ligand binding domain thereof is the ligand 
binding domain of a PPAR. In a further embodiment, the PPAR is 
30 selected firom the group consisting of PPARo, PPARy, and PPAR6, In a 
further embodiment, the ligand binding domain of the PPAR contains 
amino add residues 176-478 of human PPARyl. 

In a particular embodiment, the nudear receptor or ligand 
binding domain thereof contains amino adds 143-462 of human RARa. 
35 In another embodiment, the nudear receptor or ligand binding domain 
thereof contains amino adds 122-410 of rat TgRal. In another 
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embodiment, the nuclear receptor or ligand binding domain thereof 
contains amino acids 227-463 of mouse KXR7. In another embodiment, 

the nuclear receptor or ligand binding domain thereof contains amino 
adds 251-595 of human ER. 
5 In a particular embodiment, the above-described methods 

utilize full-length CBP, either mouse or human. In other embodiments, 
the methods utilize amino add residues 1-113 of human CBP. In 
another embodiment, the methods utilize amino add residues 1-453 of 
human CBP.^ 

10 The conditions under which the methods described above 

are carried out are conditions that are typically used in the art for the 
study of protein-protein interactions: e.g,y physiological pH; salt 
conditions such as those represented by such commonly used buffers as 
PBS; a temperature of about 4®C to about 55°C, The presence of 

15 commonly used non-ionic detergents, c.^., NP-40®, sarcosyl, Triton X- 
100®, is optional. When europivmi cryptates are used as fluorescent 
reagents, reactions should contain KF at a concentration of at least 200 
mM. 

In prindple, one could measure FRET by monitoring either 

20 (a) a decrease in the emission of the donor fluorescent reagent following 
stimulation at the donor's absorption wavelength and/or (b) an increase 
in the emission of tiie acceptor reagent following stimulation at the 
donor's absorption wavelength. In practice, FRET is most efifectively 
measured by emission ratioing. Emission ratioing monitors the diange 

25 in the ratio of emission by the acceptor over emission by the donor. An 
increase in this ratio signifies that energy is being transferred from 
donor to acceptor and thus that FRET is occurring. Emission ratioing 
can be measured by employing a laser-scanning confocal microscope. 
Emission ratioing is preferably done by splitting the emitted light from a 

30 sample with a dichroic mirror and measuring two wavelength bands 
(corresponding to the donor and the acceptor emission wavelengths) 
simultaneously with two detectors. Alternatively, the emitted light can 
be sampled consecutively at each wavelength Oby using appropriate 
filters) with a single detector. In any case, these and other methods of 

35 measuring FRET are well known in tiie art. 
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Although a variety of donor and acceptor fluorescent 
reagents can be used in the practice of the present invention, preferred 
embodiments of the present invention make use of cryptates of 
fluorescent reagents as donor reagents. Inclusion of a substrate into the 

5 intramolecular cavity of a macropolycydic ligand results in the 
formation of a cryptate. The macropolycydic ligand shields the 
substrate from interaction with solvent and other solute molecules. If 
the substrate is a fluororescent reagent, formation of a cryptate may 
result in markedly different spectroscopic characteristics for the reagent 

10 as compared to the spectroscopic characteristics of the free reagent. 

The present invention includes the use of europium (EuHI) 
or terbiiun (TbHI) cryptates as donor fluorescent reagents. Such EuDI or 
TbUI cryptates, as well as methods for their formation, are well known 
in the art. For example, see Alpha et al., 1987, Angew. Chem. Int. Ed. 

15 Engl. 26:266-267; Mathis, 1995, CHn. Chem, 41:1391-1397. A europium 
cryptate is formed by the inclusion of a eiuropiiim ion into the 
intramolecular cavity of a macropolycydic ligand which contains 
bipyridine groups as light absorbers. When europium cryptates are 
present in solution together with fluoride ions, a total shielding of the 

20 europium cryptate fluorescence is occurs. The molecular structure of a 
europium cryptate is shown below. 



NH2 NH2 

C2H4 92*^^ 
NH NH 
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Europiiuu cryptates can be conjugated to proteins by the use 
of well-known heterobifunctional reagents (see, e.g. , International 
Patent Application WO 89/05813; Prat et al., 1991, Anal. Biochem. 
195:283-289; Lopez et al., 1993, Clin. Chem. 39:196-201). 
5 The present invention includes the use of XL665 as the 

acceptor fluorescent reagent. XL665 is a crosslinked derivative of 
allophycocyanin (APC). APC is a porphyrin containing protein which is 
derived from the light harvesting system of algae (Kronick, 1986, M. 
Immunol. Meth. 92:1-13). XL665 has an absorption maximum at =620 

10 nm and an emission maximum at 665 nm. In some embodiments of the 
invention, XL665 is labeled witti streptavidin in order to effect the 
binding of the streptavidin-labled XL665 to a biotin-labeled substance, 
e.g. , CBP or the ligand binding domain of a nuclear receptor. 
Streptavidin labehng of XL655 and biotin labeling of CBP, or the ligand 

15 binding domain of a nuclear receptor, can be performed by well known 
methods. 

In a preferred embodiment of the invention, XL665 as the 
acceptor fluorescent reagent is combined with Europium cryptate 
(Eu3+K) as the donor fluorescent reagent. Etiropium cryptate (Eu3+K) 
20 has a large Stokes shift, absorbing Hght at 337 nm and emitting at 620 
ran. Thus, the emission maximum of Europium cryptate (Eu3-t-K) 
overlaps tiie absorption maximum of XL665. Europium cryptate 
(Eu3+K) has a large temporal shift; the time between absorption and 
emission of a photon is about 1 millisecond. This is advantageous 
because most background fluorescence signals in biological samples are 
short-Uved. Thus the use of a fluorescent reagent such as europium 
ciyptate, with a long fluorescent lifetime, permits time-resolved 
detection resiilting in the reduction of background interference. 

The spectiral and temporal properties of europium cryptate 
30 (Eu3+K) result in essentially no fluorescence background and thus 
assays using this fluorescent reagent can be carried out in a "mix and 
read" mode, greatiy facilitating its use as a high throughput screening 
tool. For the embodiment using Europium cryptate (Eu3+K) and XL665, 
the measuring instrument irradiates the sample at 337 nm and 
measures the fluorescence output at two wavelengths, 620 nm (B counts, 
europium fluorescence) and 665 nm (A counts, XL665 fluorescence). 
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The extent of flurorescent resonance energy transfer is measured as the 
ratio between these two values. Typically this ratio is multiplied by 
10,000 to give whole numbers. 

Other FRET donor-acceptor pairs are suitable for the 

S practice of the present invention. For example, the following donor- 
acceptor pairs can be used: dansyl/fluorescein; fluorescein/rhodamine; 
tryptophan/aminocoumarin. 

The present invention provides a nuclear receptor or ligand 
binding domain thereof labeled with a fluorescent reagent for use in the 

10 above-described methods of identifying agonists and antagonists of 
nuclear receptors. The present invention also provides CBP, p300, or 
other nuclear receptor co-activator, or a binding portion thereof, labeled 
with a fluorescent reagent. 

In a particular embodiment, the nuclear receptor or ligand 

15 binding domain thereof is selected from the group consisting of PPAEa, 
PPARy, PPAR5, a ligand binding domain of PPARa, PPARy, or PPAR5, 
and amino add residues 176-478 of human PPARyl and the fluorescent 
reagent is selected from the group consisting of XL665 and Europium 
cryptate (EuB'f K). 

20 In a particular embodiment, GBP, p300, or other nuclear 

receptor co-activator is labeled with a fluorescent reagent selected from 
the group consisting of XL665 and Eiiropium cryptate (Eu3+K). 

The following non-limiting examples are presented to better 
25 illustrate the invention. 

EXAMPLE 1 

Cloning- expression, and purification of human CBP and PPAR 

30 To test whether human CBP can interact with PPARs in an 

agonist-dependent manner, we cloned the human cDNA fragments 
encoding the NH2-terminal 1-113 amino adds (hCBPl-113) and 1-453 

amino adds (hCBPl-453) of htiman GBP by the polymerase diain 
reaction (PGR). The DNA and amino add sequences of human CBP are 
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disclosed in Borrow et al., 1996, Nature Genet. 14:33-41 and in GenBank, 
accession no. U47741. 

The primers used for hCBPl-113 were: 

5'-ACTCGGATCCAAGCCATGGCTGAGAACTTGCTGGACGG-3^ 
5 (SEQ.ID.N0.:9) and 

5'-CACAAAGCTTAGGCCATGTTAGCACTGTTCGG-3' (SEQ JD.NO. : 
10). 

These primers were expected to amplify a 0.9 kb DNA fragment. 

The primers for hCBPl-453 were: 

10 5'-ACTCGGATCCAAGCCATGGCTGAGAACTTGCTGGACGG-3' 
(SEQ.ID.N0.:9) and 

5'CTCAGTCGACTTATTGAATTCCACTAGCTGGAGATCC-3* 
(SEQ.ID.N0.:11). 

These primers were expected to amplify a 1.5 kb DNA fragment.. 

15 The template for the PGR reaction was a human fetal brain 

cDNA library (Stratagene, Catalogue #IS 937227). Of coin-se, any 
human cDNA library from a tissue expressing CBP could have been 
used. The PGR amplified 0.9 kb and 1.5 kp DNA fragments which were 
digested with restriction endonucleases and ligated into pBluescript II 

20 vector. DNA sequencing analysis confirmed that the amplified 

fragments were identical to the corresponding published nucleic add 
sequences of himian GBP, 

Based on the publicly available sequences for human GBP 
dted above » other primers could be readily identified and prepared by 

25 those skilled in the art in order to amplify and clone other portions of 
cDNA encoding human CBP from appropriate cDNA libraries. Once 
such portions of human GBP are produced, they could be used in the 
methods of the present invention in a manner similar to that described 
herein for hGBPl-113 and hGBPl-453. The amino add sequence of 

30 himian GBP is shown in Figure 7A; the nudeic add sequence of the 
cDNA encoding human GBP is shown in Figure 7B. 

To express the polypeptides encoded by the PGR fragments, 
vectors encoding fiision proteins of the polypeptides and glutathione S- 
transferase (GST) were constructed and expressed in E. colL The PGR 

35 fragments were subdoned into the expression vector pGEX (Pharmada 
Biotech) to generate pGEXhGBPl.113 and pGEXhGBPl-453. 
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pGEXhCBPl-113 and pGEXhCBPl-453 were transfected into the DH5a 
strain ofE. coli (GIBCO BRL) and the bacteria hosting either 
pGEXhCBPl-113 or pGEXhCBPl-453 were cultured in LB medium 
(GIBCO BRL) to a density of OD600 = 0.7-1.0 and induced for 

5 overexpression of the GST-CBP fusion proteins by addition of IPTG 
(isopropylthio-^-galactoside) to a final concentration of 0.2 mM. The 

IPTG induced cultures were further grown at room temperature for 2-5 
hrs. The cells were harvested by centrifugation for 10 min at SOOOg. The 
cell pellet was used for GST-CBP fusion protein purification by following 

10 the procedure from Pharmacia Biotech using Glutathione Sepharose 
beads. hCBPl-113 and hCBPl-453 proteins were generated by cleaving 
the corresponding GST fusion proteins with thrombin. SDS- 
polyacrylamide gel electrophoresis analysis showed that the preparation 
firom pGEXhCBPl-113 gave two polypeptide bands^ with apparent 

IS molecular weight of 12 kd and 10 kd. The 12 kd band is the expected size 
of hCBPl-113 and the 10 kd band is most likely a premature translational 
termination product. The preparation firom pGEXhCBP 1-450 gave a 

single band with the expected size, 50 kd. 

cDNAs encoding, full-length PPARa and PPARyl were 

20 subdoned into pGEX vectors for the production of GST-PPARa and GST- 
PPARyI fusion proteins in Exoli. PPARyl was cloned firom a human fat 
cell cDNA library (see Elbrecht et al.» 1996» Biochem. Biophys. Res. 
Comm. 224:431-437). A cDNA encoding the hvunan PPARyI ligand 
binding domain (PPARyILBD; amino adds 176-478 of PPARyl) was 

25 subdoned from a modified pSG5 vector as a Xho I (site located in the N- 
terminus of the LBD)/ Xba I (site located in the pSGrS vector) firagment. 
The Xba I site was bluntrended with T4 DNA polymerase. The 1.1 kb 
firagment containing the LBD was purified from an agarose gel and 
ligated into pGEX-KG (see Guan & Dixon, 1991, Ansl. Biochem. 192:262- 

30 267) that had been digested with Xho 1 and Hind III (the Hind IH site 
had been blunt-ended with T4 DNA poljnnerase). This construct was 
used for the production of GST-hPPARylLBD and hPPARvlLBD (the 

ligand binding domain cleaved firee of GST). The overexpression and 
purification of PPARa, PPARyl, and PPARyILBD were as described 

35 above for GBP. 
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The DNA and amino acid sequences of human PPARa are 
disclosed in Schmidt et al., 1992, MoL Endocrinol. 6:1634-1641 and in 
GenBank, accession no. L07592. See Figure 8A and 8B. 

The DNA and amino acid sequences of human PPARyl are 

5 disclosed in Greene et al., 1995, Gene Expr. 4:281-299; Qi et al., 1995, Mol. 
Cell. BioL 15:1817-1825; Elbrecht et al., 1996, Biochem. Biophys. Res. 
Comm. 224:431-437; and in GenBank, accession no. L40904. See Figure 
9A and 9B. Human PPARy2 contains the same amino add sequence as 
human PPARyl except for an amino terminal addition of 24 amino adds 

10 (see Elbrecht et al., 1996, Biochem. Biophys. Res. Comm. 224:431-437). 
Thus, the amino add sequence of the ligand binding domain of human 
PP ARy2 is the same as the amino add sequence of the ligand binding 
domain of human PPARyl, although the nimabering of the amino adds 
differs (176-478 for human PPARyl and 200-502 for human PPAR72). 

15 The DNA and amino add sequences of hiunan PPAR5 are 

disclosed in Sher et al., 1993, Biochemistry 32:5598-5604 and in GenBank, 
accession no. L02932. See Figure lOA-C. 

EXAMPLE 2 

20 Interaction between PPARs and hCBP fragments 

Experiments were first conducted using hCBPl-113 and 
hPPARylLBD^ Purified hPPARylLBD was biotinylated with Sulfo-NHS- 
LC-Biotin (PIERCE) to a biotin:hPPARYlLBD ratio of 3:1 according to the 
procedure provided by PIERCE. Purified hCBPl-113 was directly labeled 

25 with europiiun cryptate (Eu3+K) by the method illustrated in Figure 1. 
Biotin-labeled hPPARylLBD, Eu3+K-labeled hCBPl-113, and 
streptavidin-labeled XL665 (SA-XL665; firom PACKARD) were incubated 
together in the presence or absence of 1 ^M of known PPARy agonist 
(BRL49653 or AD5075)« 

30 Thus, this experimental format made use of the fluorescent 

reagent pair europiiun cryptate (Eu3+K), which acted as donor, and 
XL665, which acted as acceptor. hCBPl-113 was directly labeled with 
europium cryptate (Eu3+K); hPPARylLBD was indirectly labeled with 

XL665 by means of a biotin-streptavidin link. The emission maximum 
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of europium cryptate (Eu3+K) overlaps with the absorption maximxmi of 
XL665. Therefore, when europium cryptate (Eu3+K) and XL665 are in 
dose proximity, and the sample is illuminated with light at 337 nm (the 
absorption maximum of europium cryptate (Eu3+K)), FRET can occur 
5 between europium cryptate (Eu3+K) and XL665. This FRET manifests 
itself as increased emission at 665 nm by XL665* Figure 2 shows a 
schematic of the format used in this experiment (experiment 1 of Table 
1). When agonist is bound to hPPARylLBD, a specific interaction occurs 
between hPFARylLBD and hCBPl-113, thus bringing europium cryptate 

10 (Eu3+K) and XL665 into close enough proximity for FRET to occur. In 
the absence of agonist, no interaction occurs between hPFARylLBD and 
hCBPl-113 and thus europium cryptate (Eu3+K) and XL665 are not 
brought into dose proximity and no FRET occurs. When FRET occurs, 
the amount of light given off by the sample at the emission mfliriTn^im of 

15 XL665 (665 nm) is increased relative to the amount of light given off by 
the sample at the emission maximum of europium cryptate (Eu3-f K) 
(620 nm). Therefore, measuring the ratio of emission at 665 nm to 620 
nm in the presence and the absence of a substance suspected of being an 
agonist allows for the determination of whether that substance actually 

20 is an agonist. If the substance is an agonist, an increase in the ratio of 
emission at 665 nm to 620 nm in the presence of the substance wiU be 
observed. 

Reactions were carried out in microtiter plates. Reaction 
conditions were: appropriate volume (total 250 jd) of the reaction buffer 

25 (either PBS or HEPES, see below, containing 500 mM KF, 0.1% bovine 
serum albumin, BSA) was added to each well, followed by addition of 
ligands (BRL49653 or AD5075 at a final concentration of 1 ^iM and 0.1% 
dimethylsulfoxide (DMSO) or vehide control (0.1% DMSO), Eu3+K 
labeled hCBP (100 nM), biotin-hPPARylLBD (100 nM), and streptavidin- 

30 labeled XL665 (100 nM) to appropriate wells. After mixing, 200^1of 
reaction mixture was transferred to a new well. The plate was either 
directly measured for fluorescence resonance energy transfer (FRET) or 
covered with sealing tape (PACKARD) to avoid evaporation and 
incubated at room temperature for up to 24 hrs before measuring FRET. 

35 The results of this experiment and others described below 

yielded ratio values as follows: 
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Table 1 



jiixpcriiucnL 


XjUIIcx 


rimiooioii rciiiio 

with AD5075 


JjiniibslUli rctbiil 

witii vehicle 


1 


PBS 


1134 


1074 


2 


HEPES + 0.05% 
NP40 


967 


617 


3 


HEPES + 0.05% 
NP40 


1078 


536 


4 


HEPES + 0.05% 
CHAPS 


1883 


487 
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Experiment 1 of Table 1 was carried out using PBS (137 mM 
NaCl, 2.7 mM KCl, 4.3 mM Na2HP04, 1.4 mM KH2PO4, pH 7.4). The 

greater emission ratio observed in the presence of AD5075 demonstrated 
that a specific interaction between hCBPl-113 and hPPARYlLBD 

5 occurred in the presence of the agonist AD5075. Although it was clear 
that FRET was occurring, the signal-noise ratio was small. In 
experiment 2 of Table 1, HEPES buflfer (N-2-hydroxyethylpiperazine-N'- 
2-ethane sulfonic acid, 100 mM, pH 7.0) containing 0.05% NP40 (Nonidet 
P-40) was used instead of PBS and an improved signal-noise ratio was 
10 obtained. 

In order to get an even better signal-noise ratio, the above- 
described format was modified slightly for experiment 3. In experiment 
3, SA-XL665 (500 nM), biotin-labeled hPFARylLBD (100 nM), GST- 
hCBPl-113, and Eu3+K labeled anti-GST antibody (2.5 jil) were incubated 

15 in the presence or absence of AD5075 (1 jiM) in HEPES buffer containing 
0.05% NP40. A two-fold signal- noise ratio was obtained. Figure 3 shows 
a schematic of the format used in experiment 3. 

The anti-GST antibody was a goat antibody to GST fix)m 
Pharmacia (catalogue number 27-4577-01) that was labeled with Eu3+K 

20 according to the procediire simunarized below. 

-Make up@ 10 
mg/mL in H20. 
Need 42,2 pg (4.2 
pL, 96.6 nmol) for 
49.0 pg Eu3+ 
Reagent 




-Resuspended @ -FW = 1465 2.9 Equiv SULFO-SMCC. 

2.5 mg/mL in 10% Use 49.0 pg 20 mM Pi buffer, 10% DMF 

DMF/PBS (19.6 pL, 33.4 nmol) RT, 30 minutes 
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5 equiv. Eu3+ 
complex 




Anti-GST Antibody, 
Cat #27^577-01 



1 ) Add Eu reagent 

2) [Pr] 4.2 mg/mL. 
Onight at 4'*C. 

Lower pH: Add 12 pL of 
IM NaPi, pH 7.0. 
pH drops to 7.18. 

350 pM TCEP 

(35 mM stock is 

10.0 mg/mLr PBS, pH 

7.0), 2.4 pL, 

15 min it then 15 min @ 

4°C 



Anti-GST Antibody. 
Cat #27-4577.01 



NH 




From Pharmacia, 5.0 mg/mL, 
FW = 150 kD Use 200 pL (1 mg, 
6.66 nmol) exchange into 10 mM 
Borate, 350 mM NaQ, 10% Gly, 
pH 8.5 with BioSpin-30 



5.0 Equiv SPDP, 
RT, 5 hours 



FW =312, Dissolve 
@ 1.00 mg/mL in EtOH. 
Add 10.4 pL (5 equiv., 
10.4 pg, 33.4 nmol) to 
protein. 



10 



To further improve the signal to noise ratio, a series of - 
experiments were conducted. Experiment 4 of Table 1 exemplifies 
results obtained firom those efforts. cDNA encoding a longer fi-agment of 
hCBP was cloned and expressed to get hCBPl-453. hCBPl-453 was 
biotinylated. Biotin-labeled hCBPl-453 (25 nM), SA-XL665 (100 nM), 
GST-hPPARYlLBD (1 nM), and Eu3+K-labeled anti-GST antibody (2 nM) 
were mixed together in the presence or absence of 1 ^M AD5075. The 
detergent was changed firom 0.05% NP40 to 0.5% CHAPS (3-{[3- 

cholamidopropyl]dimethyl-ammoniol}-l-propanesulfonate). A three- to 
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four-fold signal-noise ratio was obtained. Pigiire 4 shows the strategy 
used for experiment 4 and similar experiments. 

The correlation between results from the above-described 
assays and previously reported results from in vitro binding and 
5 transcriptional activation assays of selected antidiabetic insulin 

sensitizers that are known to be PPARy agonists (Elbrecht et al., 1996, 
Biochem Biophys Res Comm 224:431-437) was analyzed by titrating those 
known PPARy agonists in the assays described above and comparing 
EC50S so obtained with previously described values for potency in 

10 binding or transcriptional activation assays for the known agonists. The 
results are shown in Figure 5. From Figure 5, the following ECsQs can 

be derived: 

AD5075 = 8 nM 

BRL49653 = 53 nM 

15 TrogUtazone = 646 nM 

HogUtazone = 890 nM. 
These EC5OS generated in the above-described assays are in close 

agreement with those generated by in vitro binding and transcriptional 
activation studies (Elbrecht et al., 1996, Biochem Biophys Res Comm 
20 224:431-437). 

The above-described assay can also be used to characterize 
the interaction between nuclear receptors with co-activators as, e.g. , by 
determining the binding constant for that interaction. Figure 6 shows 
an example of such an application. Saturating amounts of PPARy 

25 agonist (10 ^M BRL49653) were used. Increasing concentrations of non- 
biotinylated hCBPl-453 were used to titrate away biotin-hCBP- 
PPARylLBD complex and decrease the fluorescence energy transfer. A 
Kd of 300 nM for the interaction between hCBPl-453 and PPARylLBD 
can be derived from tiie results illustrated in Figure 6 and this Kd (300 

30 nM) is a measurement ofthe afiEinity between CBP and PPARy. 

The present invention is not to be limited in scope by the 
specific embodiments described herein. Indeed, various modifications 
of the invention in addition to those described herein will become 
apparent to those skilled in the art firom the foregoing description. Such 

35 modifications are intended to fall within the scope ofthe appended 
claims. 
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WHAT IS CLAIMED: 

1. A method of identifying an agonist of a nuclear 
receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
5 labeled with a first fluorescent reagent; 

(b) CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a second fluorescent reagent; and 

(c) a substance suspected of being an agonist of the 
nuclear receptor; 

10 imder conditions such that, if the substance is an agonist of 

the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and CBP, p300, or other nuclear receptor co- 
activator, or a binding portion thereof, will occur; and 

(d) measuring fluorescence resonance energy transfer 
IS (FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

2. The method of claim 1 where the nuclear receptor or 
20 ligand binding domain thereof is selected from the group consisting of 

steroid receptors, thyroid hormone receptors, retinoic add receptors, 
peroxisome proliferator-activated receptors, retinoid X receptors, 
glucocorticoid receptors, vitamin D receptors, LXR, and FXR. 

25 3. The method of daim 1 where the nuclear receptor or 

Ugand binding domain tiiereof is selected firom the group consisting of a 
full-length nudear receptor, a ligand binding domain of a nuclear 
receptor, and an AF-2 site of a nudear receptor. 

30 4. The method of daim 1 where the nudear receptor or 

ligand binding domain thereof comprises an AF-2 site of a nuclear 
receptor. 

5. The method of daim 1 where the nuclear receptor or 
35 Ugand binding domain thereof is selected firom the group consisting of a 
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full-length PPAR, a ligand binding domain of a PPAR, and amino acid 
residues 176-478 of human PPARyl. 

6. The method of claim 1 where the nuclear receptor or 
5 ligand binding domain thereof is selected from the group consisting of 

PPARa, PPARyl, PPAR72, and PPAR5. 

7. The method of claim 1 where the nuclear receptor or 
ligand binding domain thereof comprises a ligand binding domain 

10 selected from the group consisting of amino acids 143-462 of human 
RARa, amino adds 122-410 of rat TaRal, amino acids 227-463 of mouse 
RXRy, and amino adds 251-595 of human £R. 

8. The method of claim 1 where CBP, p300, or other 
15 nuclear receptor co-activator, or a binding portion thereof is selected 

from the group consisting of full-length human CBP, full-length mouse 
CBP. amino add residues 1-113 of human CBP, and amino acid residues 
1-453 of hxmian CBP. 

20 9. The method of claim 1 where the first fluorescent 

reagent is selected from the group consisting of XL665 and Europium 
cryptate (Eu3-i-K). 

10. The method of claim 1 where the second fluorescent 
25 reagent is selected from the group consisting of XL665 and Europium 

cryptate (Eu3+K). 

11. A method of identifying an agonist of a nudear 
receptor that comprises providing: 

30 (a) a nudear receptor or Ugand binding domain thereof 

labeled with a first fluorescent reagent; 

Ot>) a binding portion of a nudear receptor co-activator, 
where the binding portion contains the amino add sequence LXXLL, 
and where the binding portion is labeled with a second fluorescent 

35 reagent; and 
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(c) a substance suspected of being an agonist of the 
nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
5 binding domain thereof and the binding portion of the nuclear receptor 
co-activator will take place; and 

(d) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents; 

where the occurrence of FR£T indicates that the substance 
10 is an agonist of the nuclear receptor. 

12. The method of claim 11 where the binding portion of a 
nuclear receptor co-activator is selected from the group consisting of 
human RIP-140, human SRC-1, mouse TIF-2, human or mouse CBP, 

15 human or mouse p300^ mouse TIF-l, and human TRIP proteins. 

13. A method of identifying an agonist of a nuclear 
receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
20 labeled with a first fluorescent reagent; 

(b) a polypeptide containing the amino add sequence 
LXXLL where the polypeptide is labeled with a second fluorescent 
reagent; and 

(c) a substance suspected of being an agonist of the 
25 nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and the polypeptide will take place; and 

(d) measuring fluorescent resonance energy transfer 
30 (FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

14. A method for identifying an antagonist of a nuclear 
35 receptor that comprises providing: 
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(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 

(b) CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof^ labeled with a second fluorescent reagent; 

5 (c) an agonist of the nuclear receptor; and 

(d) a substance suspected of being an antagonist of the 
nuclear receptor; 

under conditions such that, in the absence of the substance, 
binding between the nuclear receptor or ligand binding domain thereof 
10 and CBP, p300, or other nuclear receptor co-activator, or a binding 
portion thereof will occur; and 

(e) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents when the 
substance is present and measuring FRET between the first and second 

IS fluorescent reagents when the substance is absent; 

where the a decrease in FRET when the substance is 
present indicates that the substance is an antagonist of the nuclear 
receptor. 

20 15. The method of claim 14 where the nuclear receptor or 

ligand binding domain thereof is selected from the group consisting of 
steroid receptors, thyroid hormone receptors, retinoic add receptors, 
peroxisome proliferator-activated receptors, retinoid X receptors^ 
glucocorticoid receptors^ vitamin D receptors, LXR, and FXR. 

25 

16. The method of daim 14 where the nudear receptor or 
ligand binding domain thereof is selected firom the group consisting of a 
full-length nuclear receptor, a ligand binding domain of a nudear 
receptor, and an AF-2 site of a nuclear receptor. 

30 

17. The method of daim 14 where the nudear receptor or 
ligand binding domain thereof comprises an AF-2 site of a nuclear 
receptor. 

« 

35 18. The method of daim 14 where the nuclear receptor or 

ligand binding domain thereof is selected from the group consisting of a 
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full-length PPAR, a ligand binding domain of a PPAR, and amino acid 
residues 176-478 of human PPARyl. 

19. The method of daim 14 where the nuclear receptor or 
5 ligand binding domain thereof is selected from the group consisting of 

PPARa, PPARyl, PPARy2, and PPAR5. 

20. The method of daim 14 where the nudear receptor or 

Ugand binding domain thereof comprises a ligand binding domain 

10 selected from the group consisting of amino adds 143-462 of htiman 
RARa» amino adds 122-410 of rat TsRal, amino adds 227-463 of mouse 

RXRy, and amino adds 251-595 of human ER. 

21. The method of daim 14 where CBP, p300, or other 
IS nudear receptor co-activator, or a binding portion thereof is selected 

from the group consisting of full-length CBP, amino add residues 1-113 
of himian CBP, and amino add residues 1-453 of human CBP. 

22. The method of claim 14 where the first fluorescent 
20 reagent is selected from tiie group consisting of XL665 and Europium 

cryptate (Eu3+K). 

23. The method of daim 14 where the second fluorescent 
reagent is selected from the group consisting of XL665 and Europium 

25 cryptate (Eu3+K). 

24. A nudear receptor or Ugand binding domain thereof 
labeled with a fluorescent reagent. 

30 25. The nudear receptor or ligand binding domain 

thereof of claim 24 where the nudear receptor or ligand binding domain 
thereof is selected from the group consisting of PPARa, PPARyl, 
PPARy2. PPAR5, a Ugand binding domain of PPARa, PPARyl, PPAR72, 
or PPARS, and amino add residues 176-478 of human PPARyl and the 

35 fluorescent reagent is selected from the group consisting of XL665 and 
Europitim cryptate (Eu3-i-K). 

-33- 



BNSOOCID: <WO__99l8124Al_L> 



wo 99/18124 



PCT/US98/21049 



26. CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a fluorescent reagent. 

5 27. The CBP, p300, or other nuclear receptor co-activator, 

or a binding portion thereof, of claim 26 where the fluorescent reagent is 
selected from the group consisting of XL665 and Europitun cryptate 
(Eu3+K). 
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1 MAENLLDGPPNPKRAKLSSPGFSANDSTDFGSLFDLENDLPDELIPNGGELGLLNSGNLV 
61 PDAASKHKQLSELLRGGSGSSI NPGIGNVSASSPVQQGL6GQAQGQPNSANMASLSAMGK 
121 SPLSQ6DSSAPSLPKQAASTSGPTPAASQALNPQAQKQVGLATSSPATSQTGPGICMNAN 
181 FNQTHPGLLN5NSGHSLINQASQGQAQVMN6SLGAAGRGRGAGMPYPTPAMQGASSSVLA 
241 ETLTQVSPQMTGHAGLNTAQAGGMAKMGIT6NTSPFGQPFSQAGGQPMGATGVNPQLASK 
301 QSMVNSLPTFPTDIKNTSVTNVPNMSQMQTSV6IVPTQAIAT6PTADPEKRKLIQQQLVL 
361 LLHAHKCQRREQANGEVRACSLPHCRTMKNVLNHMTHCQAGKACQ 

FIG.7A 



1 cgagccccga cccccgtccg ggccctcgcc ggccgcgccg cccgtgcccg gggctgtttt 
61 cccgagcagg tgaaaatggc tgagaacttg ctggacggac cgcccaaccc caaaagagcc 
121 aaactcagct cgcccggttt ctcggcgaat gacagcacag attttggatc attgtttgac 
181 ttggaaaatg atcttcctga tgagctgata cccaatggag gagaattagg ccttttaaac 
241 agtgggaacc ttgttccaga tgctgcttcc aaacataaac aactgtcgga gcttctacga 
301 ggaggcagcg gctctagtat caacccagga ataggaaatg tgagcgccag cagccccgtg 
361 cagcagggcc tgggtggcca ggctcaaggg cagccgaaca gtgctaacat ggccagcctc 
421 agtgccatgg gcaagagccc tctgagccag ggagattctt cagcccccag cctgcctaaa 
481 caggcagcca gcacctctgg gcccaccccc gctgcctccc aagcactgaa tccgcaagca 
541 caaaagcaag tggggctggc gactagcagc cctgccacgt cacagactgg acctggtatc 
601 tgcatgaatg ctaactttaa ccagacccac ccaggcctcc tcaatagtaa ctctggccat 
661 agcttaatta atcaggcttc acaagggcag gcgcaagtca tgaatggatc tcttggggct 
721 gctggcagag gaaggggagc tggaatgccg taccctactc cagccatgca gggcgcctcg 
781 agcagcgtgc tggctgagac cctaacgcag gtttccccgc aaatgactgg tcacgcggga 
841 ctgaacaccg cacaggcagg aggcatggcc aagatgggaa taactgggaa cacaagtcca 
901 tttggacagc cctttagtca agctggaggg cagccaatgg gagccactgg agtgaacccc 
961 cagttagcca gcaaacagag catggtcaac agtttgccca ccttccctac agatatcaag 
1021 aatacttcag tcaccaacgt gccaaatatg tctcagatgc aaacatcagt gggaattgta 
1081 cccacacaag caattgcaac aggccccact gcagatcctg aaaaacgcaa actgatacag 
1141 cagcagctgg ttctactgct tcatgctcat aagtgtcaga gacgagagca agcaaacgga 
1201 gaggttcggg cctgctcgct cccgcattgt cgaaccatga aaaacgtttt gaatcacatg 
1261 acgcattgtc aggctgggaa agcctgccaa 

FIG.7B 
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1 MVDTESPLCPLSPLEAGDLESPLSEEFLQEMGNIQEISQSIGEDSSGSF6FTEYQYLGSC 
61 PGSDGSVITDTLSPASSPSSVTYPVVPGSVDESPSGALNIECRICGDKASGYHYGVHACE 
121 6CKGFFRRTIRLKLVYDKCDRSCKIQKKNRNKCQYCRFHKCLSVGMSHNAIRFGRMPRSE 
181 KAKLKAEILTCEHDIEDSETADLKSLAKRIYEAYLKNFNMNKVKARVILSGKASNNPPFV 
241 IHDMETLCMAEKTLVAKLVANGIQNKEVEVRIFHCCQCTSVETVTELTEFAKAIPAFANL 
301 DLNDQVTLLKYGVYEAIFAMLSSVMNKDGMLVAYGNGFITREFLKSURKPFCDIMEPKFD 
361 FAMKFNALELDOSDISLFVAAI ICCGDRPGLLNVGHIEKMQEGIVHVURLHLQSNHPDDI 
421 FLPKLLQKMADLRQLVTEHAQLVQI IKKTESDAALHPLLQEI YRDMY 

FIG.8A 

1 ggcccaggct gaagctcagg gccctgtctg ctctgtggac tcaacagttt gtggcaagac 
61 aagctcagaa ctgagaagct gtcaccacag ttctggaggc tgggaagttc aagatcaaag 
121 tgccagcaga ttcagtgtca tgtgaggacg tgcttcctgc ttcatagata agagtagctt 
181 ggagctcggc ggcacaacca gcaccatctg gtcgcgatgg tggacacgga aagcccactc 
241 tgccccctct ccccactcga ggccggcgat ctagagagcc cgttatctga agagttcctg 
301 caagaaatgg gaaacatcca agagatttcg caatccatcg gcgaggatag ttctggaagc 
361 tttggcttta cggaatacca gtatttagga agctgtcctg gctcagatgg ctcggtcatc 
421 acggacacgc tttcaccagc ttcgagcccc tcctcggtga cttatcctgt ggtccccggc 
481 agcgtggacg agtctcccag tggagcattg aacatcgaat gtagaatctg cggggacaag 
541 gcctcaggct atcattacgg agtccacgcg tgtgaaggct gcaagggctt ctttcggcga 
601 acgattcgac tcaagctggt gtatgacaag tgcgaccgca gctgcaagat ccagaaaaag 
661 aacagaaaca aatgccagta ttgtcgattt cacaagtgcc tttctgtcgg gatgtcacac 
721 aacgcgattc gttttggacg aatgccaaga tctgagaaag caaaactgaa agcagaaatt 
781 cttacctgtg aacatgacat agaagattct gaaactgcag atctcaaatc tctggccaag 
841 agaatctacg aggcctactt gaagaacttc aacatgaaca aggtcaaagc ccgggtcatc 
901 ctctcaggaa aggccagtaa caatccacct tttgtcatac atgatatgga gacactgtgt 
961 atggctgaga agacgctggt ggccaagctg gtggccaatg gcatccagaa caaggaggtg 
1021 gaggtccgca tctttcactg ctgccagtgc acgtcagtgg agaccgtcac ggagctcacg 
1081 gaattcgcca aggccatccc agcgttcgca aacttggacc tgaacgatca agtgacattg 
1141 ctaaaatacg gagtttatga ggccatattc gccatgctgt cttctgtgat gaacaaagac 
1201 gggatgctgg tagcgtatgg aaatgggttt ataactcgtg aattcctaaa aagcctaagg 
1261 aaaccgttct gtgatatcat ggaacccaag tttgattttg ccatgaagtt caatgcactg 
1321 gaactggatg acagtgatat ctcccttttt gtggctgcta tcatttgctg tggagatcgt 
1381 cctggccttc taaacgtagg acacattgaa aaaatgcagg agggtattgt acatgtgctc 
1441 agactccacc tgcagagcaa ccacccggac gatatctttc tcttcccaaa acttcttcaa 
1501 aaaatggcag acctccggca gctggtgacg gagcatgcgc agctggtgca gatcatcaag 
1561 aagacggagt cggatgctgc gctgcacccg ctactgcagg agatctacag ggacatgtac 
1621 tgagttcctt cagatcagcc acaccttttc caggagttct gaagctgaca gcactacaaa 
1681 ggagacgggg gagcagcacg attttgcaca aatatccacc actttaacct tagagcttgg 
1741 acagtctgag ctgtaggtaa ccggcatatt attccatatc tttgttttaa ccagtacttc 
1801 taagagcata gaactcaaat gctgggggag gtggctaatc tcaggactgg gaag 

FIG.8B 
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1 MTMVDTEIAFWPTNFGISSVDLSVMEDHSHSFDIKPFTTVDFSSISTPHYEDIPFTRTDP 
61 VVADYKYDLKLQEYQSAI KVEPASPPYYSEKTQL YNKPHEEPSNSLMAI ECRVCGDKASG 
121 FHYGVHACEGCKGFFRRTIRLKLIYDRCOLNCRIHKKSRNKCQYCRFQKCLAVGMSHNAI 
181 RFGRIAQAEKEKLLAEISSDIDQLNPESADLRQALAKHLYDSYIKSFPLTKAKARAILTG 
241 KnOKSPFVIYDMNSLMMGEDKIKFKHITPLQEQSKEVAIRIFQGCQFRSVEAVQEITEY 
301 AK5IPGFVNLDLNDQVTLLKYGVHEIIYTMLASLMNKDGVLISEGQGFMTREFLKSLRKP 
361 FGDFMEPKFEFAVKFNALELDDSDLAIFIAV1ILSGDRP6LLNVKPIEDIQDNLLQALEL 
421 QLKLNHPESSQUAKLLQKMTDLRQI VTEHVQLLQV I KKTETDMSLHPLLQE I YKDL Y 

FIG.9A 



1 ccgaccttac cccaggcggc cttgacgttg gtcttgtcgg caggagacag caccatggtg 
61 ggttctctct gagtctggga attcccgagc ccgagccgca gccgccgcct ggggggcttg 
121 ggtcggcctc gaggacaccg gagaggggcg ccacgccgcc gtggccgcag aaatgaccat 
181 ggttgacaca gagatcgcat tctggcccac caactttggg atcagctccg tggatctctc 
241 cgtaatggaa gaccactccc actcctttga tatcaagccc ttcactactg ttgacttctc 
301 cagcatttct actccacatt acgaagacat tccattcaca agaacagatc cagtggttgc 
361 agattacaag tatgacctga aacttcaaga gtaccaaagt gcaatcaaag tggagcctgc 
421 atctccacct tattattctg agaagactca gctctacaat aagcctcatg aagagccttc 
481 caactccctc atggcaattg aatgtcgtgt ctgtggagat aaagcttctg gatttcacta 
541 tggagttcat gcttgtgaag gatgcaaggg tttcttccgg agaacaatca gattgaagct 
601 tatctatgac agatgtgatc ttaactgtcg gatccacaaa aaaagtagaa ataaatgtca 
661 gtactgtcgg tttcagaaat gccttgcagt ggggatgtct cataatgcca tcaggtttgg 
721 gcggatcgca caggccgaga aggagaagct gttggcggag atctccagtg atatcgacca 
781 gctgaatcca gagtccgctg acctccgtca ggccctggca aaacatttgt atgactcata 
841 cataaagtcc ttcccgctga ccaaagcaaa ggcgagggcg atcttgacag gaaagacaac 
901 agacaaatca ccattcgtta tctatgacat gaattcctta atgatgggag aagataaaat 
961 caagttcaaa cacatcaccc ccctgcagga gcagagcaaa gaggtggcca tccgcatctt 
1021 tcagggctgc cagtttcgct ccgtggaggc tgtgcaggag atcacagagt atgccaaaag 
1081 cattcctggt tttgtaaatc ttgacttgaa cgaccaagta actctcctca aatatggagt 
1141 ccacgagatc atttacacaa tgctggcctc cttgatgaat aaagatgggg ttctcatatc 
1201 cgagggccaa ggcttcatga caagggagtt tctaaagagc ctgcgaaagc cttttggtga 
1261 ctttatggag cccaagtttg agtttgctgt gaagttcaat gcactggaat tagatgacag 
1321 cgacttggca atatttattg ctgtcattat tctcagtgga gaccgcccag gtttgctgaa 
1381 tgtgaagccc attgaagaca ttcaagacaa cctgctacaa gccctggagc tccagctgaa 
1441 gctgaaccac cctgagtcct cacagctgtt tgccaagctg ctccagaaaa tgacagacct 
1501 cagacagatt gtcacggaac acgtgcagct actgcaggtg atcaagaaga cggagacaga 
1561 catgagtctt cacccgctcc tgcaggagat ctacaaggac ttgtactagc agagagtcct 
1621 gagccactgc caacatttcc cttcttccag ttgcactatt ctgagggaaa atctgaccat 
1681 aagaaattta ctgtgaaaaa gcgttttaaa aagaaaaggg tttagaatat gatctatttt 
1741 atgcatattg tttataaaga cacatttaca atttactttt aatattaaaa attaccatat 
1801 tatgaaattg c 

F1G.9B 
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1 ME0PQEEAPEVREEEEKEEVAEAE6APELNGGPQHALPSSSYTDLSRSSSPPSLLDQLQM 

61 GCPGASCGSLNMECRVCGDKAS6FHYGVHACE6CKGFFRRTIRMKLEYEKCERSCKIQKK 

121 NRNKCQYCRFQKCLALGMSHNAIRFGRMPEAEKRKLVAGLTANEGSQYNPOVADLKAFSK 

181 HIYNAYLKNFNMTKKKARSILTGKASHTAPFVIHDIETLWQAEKGLVWKQLVNGLPPYKE 

241 ISVHVFYRCQCnVETVRELTEFAKSIPSFSSLFLNDQVTLLKYGVHEAIFAMLASIVNK 

301 DGLLVANGS6FVTREFLRSLRKPFSDIIEPKFEFAVKFNALELDDSDLALFI AAIILCGD 

361 RPGLMNVPRVEAIQDTILRALEFHLQANHPDAQYLFPKLLQKMADLRQLVTEHAQMMQRI 

421 KKTETETSLHPLLQEIYKDMY 



FIG.10A 
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1 gaattctgcg gagcctgcgg gacggcggcg ggttggcccg taggcagccg ggacagtgtt 
61 gtacagtgtt ttgggcatgc acgtgatact cacacagtgg cttctgctca ccaacagatg 
121 aagacagatg caccaacgag ggtctggaat ggtctggagt ggtctggaaa gcagggtcag 
181 atacccctgg aaaactgaag cccgtggagc aatgatctct acaggactgc ttcaaggctg 
241 atgggaacca ccctgtagag gtccatctgc gttcagaccc agacgatgcc agagctatga 
301 ctgggcctgc aggtgtggcg ccgaggggag atcagccatg gagcagccac aggaggaagc 
361 ccctgaggtc cgggaagagg aggagaaaga ggaagtggca gaggcagaag gagccccaga 
421 gctcaatggg ggaccacagc atgcacttcc ttccagcagc tacacagacc tctcccggag 
481 ctcctcgcca ccctcactgc tggaccaact gcagatgggc tgtgacgggg cctcatgcgg 
541 cagcctcaac atggagtgcc gggtgtgcgg ggacaaggca tcgggcttcc actacggtgt 
601 tcatgcatgt gaggggtgca agggcttctt ccgtcgtacg atccgcatga agctggagta 
661 cgagaagtgt gagcgcagct gcaagattca gaagaagaac cgcaacaagt gccagtactg 
721 ccgcttccag aagtgcctgg cactgggcat gtcacacaac gctatccgtt ttggtcggat 
781 gccggaggct gagaagagga agctggtggc agggctgact gcaaacgagg ggagccagta 
841 caacccacag gtggccgacc tgaaggcctt ctccaagcac atctacaatg cctacctgaa 
901 aaacttcaac atgaccaaaa agaaggcccg cagcatcctc accggcaaag ccagccacac 
961 ggcgcccttt gtgatccacg acatcgagac attgtggcag gcagagaagg ggctggtgtg 
1021 gaagcagttg gtgaatggcc tgcctcccta caaggagatc agcgtgcacg tcttctaccg 
1081 ctgccagtgc accacagtgg agaccgtgcg ggagctcact gagttcgcca agagcatccc 
1141 cagcttcagc agcctcttcc tcaacgacca ggttaccctt ctcaagtatg gcgtgcacga 
1201 ggccatcttc gccatgctgg cctctatcgt caacaaggac gggctgctgg tagccaacgg 
1261 cagtggcttt gtcacccgtg agttcctgcg cagcctccgc aaacccttca gtgatatcat 
1321 tgagcctaag tttgaatttg ctgtcaagtt caacgccctg gaacttgatg acagtgacct 
1381 ggccctattc attgcggcca tcattctgtg tggagaccgg ccaggcctca tgaacgttcc 
1441 acgggtggag gctatccagg acaccatcct gcgtgccctc gaattccacc tgcaggccaa 
1501 ccaccctgat gcccagtacc tcttccccaa gctgctgcag aagatggctg acctgcggca 
1561 actggtcacc gagcacgccc agatgatgca gcggatcaag aagaccgaaa ccpagacctc 
1621 gctgcaccct ctgctccagg agatctacaa ggacatgtac taacggcggc acccaggcct 
1681 ccctgcagac tccaatgggg ccagcactgg aggggcccac ccacatgact tttccattga 
1741 ccagctctct tcctgtcttt gttgtctccc tctttctcag ttcctctttc ttttctaatt 
1801 cctgttgctc tgtttcttcc tttctgtagg tttctctctt cccttctccc ttctcccttg 
1861 ccctcccttt ctctctccta tccccacgtc tgtcctcctt tcttattctg tgagatgttt 
1921 tgtattattt caccagcagc atagaacagg acctctgctt ttgcacacct tttccccagg 
1981 agcagaagag agtgggcctg ccctctgccc catcattgca cctgcaggct taggtcctca 
2041 cttctgtctc ctgtcttcag agcaaaagac ttgagccatc caaagaaaca ctaagctctc 
2101 tgggcctggg ttccagggaa ggctaagcat ggcctggact gactgcagcc ccctatagtc 
2161 atggggtccc tgctgcaaag gacagtggca gaccccggca gtagagccga gatgcctccc 
2221 caagactgtc attgcccctc cgatcgtgag gccacccact gacccaatga tcctctccag 
2281 cagcacacct cagccccact gacacccagt gtccttccat cttcacactg gtttgccagg 
2341 ccaatgttgc tgatggcccc tccagcacac acacataagc actgaaatca ctttacctgc 
2401 aggcaccatg cacctccctt ccctccctga ggcaggtgag aacccagaga gaggggcctg 

FIG.10B 
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2461 caggtgagca ggcagggctg ggccaggtct ccggggaggc aggggtcctg caggtcctgg 
2521 tgggtcagcc cagcacctcg cccagtggga gcttcccggg ataaactgag cctgttcatt 
2581 ctgatgtcca tttgtcccaa tagctctact gccctcccct tcccctttac tcagcccagc 
2641 tggccaccta gaagtctccc tgcacagcct ctagtgtccg gggaccttgt gggaccagtc 
2701 ccacaccgct ggtccctgcc ctcccctgct cccaggttga ggtgcgctca cctcagagca 
2761 gggccaaagc acagctgggc atgccatgtc tgagcggcgc agagccctcc aggcctgcag 
2821 gggcaagggg ctggctggag tctcagagca cagaggtagg agaactgggg ttcaagccca 
2881 ggcttcctgg gtcctgcctg gtcctccctc ccaaggagcc attctatgtg actctgggtg 
2941 gaagtgccca gcccctgcct gacggnnnnn nngatcactc tctgctggca ggattcttcc 
3001 cgctccccac ctacccagct gatgggggtt ggggtgcttc tttcagccaa ggctatgaag 
3061 ggacagctgc tgggacccac ctcccccctt ccccggccac atgccgcgtc cctgccccca 
3121 cccgggtctg gtgctgagga tacagctctt ctcagtgtct gaacaatctc caaaattgaa 
3181 atgtatattt ttgctaggag ccccagcttc ctgtgttttt aatataaata gtgtacacag 
3241 actgacgaaa ctttaaataa atgggaatta aatatttaaa aaaaaaagcg gccgcgaatt 

3301 c 



FIG. IOC 



8NS0OC1D: <WO_e918124A1J.> 



wo 99/1 8124 PCTAJS98/2 1 049 



SEQUENCE LISTING . 
(1) GENERAL INFORMATION: 
(i) APPLICANT: Merck t Co., Inc. 

(ii) TITLE OF INVENTION: ASSAYS FOR NUCLEAR RECEPTOR 

AGONISTS AND ANTAGONISTS USING FLUORESCENCE RESONANCE 
ENERGY TRANSFER 

(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Merck & Co., Inc. 

(B) STREET: P.O. Box 2000, 126 E. Lincoln Ave. 

(C) CITY: Rahway 

(D) STATE: NJ 

(E) COUNTRY: USA 

(F) ZIP: 07065-0900 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

(vi) CURRENT APPLICATION DATA: 
{A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Coppola, Joseph A 

(B) REGISTRATION NUMBER: 38,413 

(C) REFERENCE/DOCKET NUMBER: 20017PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 732-594-6734 

(B) TELEFAX: 732-594-4720 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1; 

Met Ala Glu Asn Leu Leu Asp Gly Pro Pro Asn Pro Lys Arg Ala Lys 

15 10 15 

Leu Ser Ser Pro Gly Phe Ser Ala Asn Asp Ser Thr Asp Phe Gly Ser 

20 25 30 

Leu Phe Asp Leu Glu Asn Asp Leu Pro Asp Glu Leu lie Pro Asn Gly 

35 40 45 

Gly Glu Leu Gly Leu Leu Asn Ser Gly Asn Leu Val Pro Asp Ala Ala 

50 55 60 

Ser Lys His Lys Gin Leu Ser Glu Leu Leu Arg Gly Gly Ser Gly Ser 
65 70 75 80 

Ser lie Asn Pro Gly lie Gly Asn Val Ser Ala Ser Ser Pro Val Gin 

85 .90 95 

Gin Gly Leu Gly Gly Gin Ala Gin Gly Gin Pro Asn Ser Ala Asn Met 

100 105 110 

Ala Ser Leu Ser Ala Met Gly Lys Ser Pro Leu Ser Gin Gly Asp Ser 

115 120 125 

Ser Ala Pro Ser Leu Pro Lys Gin Ala Ala Ser Thr Ser Gly Pro Thr 

130 135 140 

Pro Ala Ala Ser Gin Ala Leu Asn Pro Gin Ala Gin Lys Gin Val Gly 
145 150 155 160 

Leu Ala Thr Ser Ser Pro Ala Thr Ser Gin Thr Gly Pro Gly He Cys 

165 170 175 

Met Asn Ala Asn Phe Asn Gin Thr His Pro Gly Leu Leu Asn Ser Asn 

ISO 185 190 

Ser Gly His Ser Leu He Asn Gin Ala Ser Gin Gly Gin Ala Gin Val 

195 200 205 

Met Asn Gly Ser Leu Gly Ala Ala Gly Arg Gly Arg Gly Ala Gly Met 

210 215 220 

Pro Tyr Pro Thr Pro Ala Met Gin Gly Ala Ser Ser Ser Val Leu Ala 
225 230 235 240 

Glu Thr Leu Thr Gin Val Ser Pro Gin Met Thr Gly His Ala Gly Leu 

245 250 255 

Asn Thr Ala Gin Ala Gly Gly Met Ala Lys Met Gly He Thr Gly Asn 

260 265 270 

Thr Ser Pro Phe Gly Gin Pro Phe Ser Gin Ala Gly Gly Gin Pro Met 

275 280 285 

Gly Ala Thr Gly Val Asn Pro Gin Leu Ala Ser Lys Gin Ser Met Val 

290 295 300 

Asn Ser Leu Pro Thr Phe Pro Thr Asp He Lys Asn Thr Ser Val Thr 
305 310 315 320 

Asn Val Pro Asn Met Ser Gin Met Gin Thr Ser Val Gly He Val Pro 

325 330 335 

Thr Gin Ala He Ala Thr Gly Pro Thr Ala Asp Pro Glu Lys Arg Lys 

340 345 350 

Leu He Gin Gin Gin Leu Val Leu Leu Leu His Ala His Lys Cys Gin 

355 360 365 

Arg Arg Glu Gin Ala Asn Gly Glu Val Arg Ala Cys Ser Leu Pro His 

370 375 380 

Cys Arg Thr Met Lys Asn Val Leu Asn His Met Thr His Cys Gin Ala 
385 390 395 400 

Gly Lys Ala Cys Gin 

405 
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(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STEUNDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



CGAGCCCCGA 
CCCGAGCAGG 
AAACTCAGCT 
TTGGAAAATG 
AGTGGGAACC 
GGAGGCAGCG 
CAGCAGGGCC 
AGTGCCATGG 
CAGGCAGCCA 
CAAAAGCAAG 
TGCATGAATG 
AGCTTAATTA 
GCTGGCAGAG 
AGCAGCGTGC 
CTGAACACCG 
TTTGGACAGC 
CAGTTAGCCA 
AATACTTCAG 
CCCACACAAG 
CAGCAGCTGG 
GAGGTTCGGG 
ACGCATTGTC 



CCCCCGTCCG 

TGAAAATGGC 

CGCCCGGTTT 

ATCTTCCTGA 

TTGTTCCAGA 

GCTCTAGTAT 

TGGGTGGCCA 

GCAAGAGCCC 

GCACCTCTGG 

TGGGGCTGGC 

CTAACTTTAA 

ATCAGGCTTC 

GAAGGGGAGC 

TGGCTGAGAC 

CACAGGCAGG 

CCTTTAGTCA 

GCAAACAGAG 

TCACCAACGT 

CAATTGCAAC 

TTCTACTGCT 

CCTGCTCGCT 

AGGCTGGGAA 



GGCCCTCGCC 

TGAGAACTTG 

CTCGGCGAAT 

TGAGCTGATA 

TGCTGCTTCC 

CAACCCAGGA 

GGCTCAAGGG 

TCTGAGCCAG 

GCCCACCCCC 

GACTAGCAGC 

CCAGACCCAC 

ACAAGGGCAG 

TGGAATGCCG 

CCTAACGCAG 

AGGCATGGCC 

AGCTGGAGGG 

CATGGTCAAC 

GCCAAATATG 

AGGCCCCACT 

TCATGCTCAT 

CCCGCATTGT 

AGCCTGCCAA 



GGCCGCGCCG 

CTGGACGGAC 

GACAGCACAG 

CCCAATGGAG 

AAACATAAAC 

ATAGGAAATG 

CAGCCGAACA 

GGAGATTCTT 

GCTGCCTCCC 

CCTGCCACGT 

CCAGGCCTCC 

GCGCAAGTCA 

TACCCTACTC 

GTTTCCCCGC 

AAGATGGGAA 

CAGCCAATGG 

AGTTTGCCCA 

TCTCAGATGC 

GCAGATCCTG 

AAGTGTCAGA 

CGAACCATGA 



CCCGTGCCCG 

CGCCCAACCC 

ATTTTGGATC 

GAGAATTAGG 

AACTGTCGGA 

TGAGCGCCAG 

GTGCTAACAT 

CAGCCCCCAG 

AAGCACTGAA 

CACAGACTGG 

TCAATAGTAA 

TGAATGGATC 

CAGCCATGCA 

AAATGACTGG 

TAACTGGGAA 

GAGCCACTGG 

CCTTCCCTAC 

AAACATCAGT 

AAAAACGCAA 

GACGAGAGCA 

AAAACGTTTT 



GGGCTGTTTT 60 

CAAAAGAGCC 120 

ATTGTTTGAC 180 

CCTTTTAAAC 240 

GCTTCTACGA 300 

CAGCCCCGTG 360 

GGCCAGCCTC 420 

CCTGCCTAAA 480 

TCCGCAAGCA 540 

ACCTGGTATC 600 

CTCTGGCCAT 660 

TCTTGGGGCT 720 

GGGCGCCTCG 780 

TCACGCGGGA 840 

CACAAGTCCA 900 

AGTGAACCCC 960 

AGATATCAAG 1020 

GGGAATTGTA 1080 

ACTGATACAG 1140 

AGCAAACGGA 1200 

GAATCACATG 1260 

1290 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOIjECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Val Asp Thr Glu Ser Pro Leu Cys Pro Leu Ser Pro Leu Glu Ala 

15 10 15 

Gly Asp Leu Glu Ser Pro Leu Ser Glu Glu Phe Leu Gin Glu Met Gly 

20 25 30 

Asn lie Gin Glu lie Ser Gin Ser lie Gly Glu Asp Ser Ser Gly Ser 

35 40 45 

Phe Gly Phe Thr Glu Tyr Gin Tyr Leu Gly Ser Cys Pro Gly Ser Asp 
50 55 60 
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Gly 


Ser 


Val 


He 


Thr 


Asp 


Thr 


Leu 


Ser 


Pro 


Ala 


Ser 


Ser 


Pro 


Ser 


Ser 


65 










70 










75 










80 


Val 


Thr 


Tyr 


Pro 


Val 


Val 


Pro 


Gly 


Ser 


Val 


Asp 


Glu 


Ser 


Pro 


Ser 


Gly 










85 










90 










95 


Ala 


Leu 


Asn 


He 


Glu 


Cys 


Arg 


He 


Cys 


Gly 


Asp 


Lys 


Ala 


Ser 


Gly 


Tyr 








100 










105 










110 


His 


Tyr 


Gly 


Val 


His 


Ala 


Cys 


Glu 


Gly 


Cys 


Lys 


Gly 


Phe 


Phe 


Arg 


Arg 






115 










120 










125 




Thr 


He 


Arg 


Leu 


Lys 


Leu 


Val 


Tyr 


Asp 


Lys 


Cys 


Asp 


Arg 


Ser 


Cys 


Lys 




130 










135 










140 






lie 


Gin 


Lys 


Lys 


Asn 


Arg 


Asn 


Lys 


Cys 


Gin 


Tyr 


Cys 


Arg 


Phe 


His 


Lys 


145 










150 










155 










160 


Cys 


Leu 


Ser 


Val 


Gly 
165 


Met 


Ser 


His 


Asn 


Ala 
170 


He 


Arg 


Phe 


Gly 


Arg 
175 


Met 


Pro 


Arg 


Ser 


Glu 


Lys 


Ala 


Lys 


Leu 


Lys 


Ala 


Glu 


He 


Leu 


Thr 


Cys 


Glu 








180 










185 










190 




His 


Asp 


He 


Glu 


Asp 


Ser 


Glu 


Thr 


Ala 


Asp 


Leu 


Lys 


Ser 


Leu 


Ala 


Lys 






195 










200 








205 






Arg 


He 


Tyr 


Glu 


Ala 


Tyr 


Leu 


Lys 


Asn 


Phe 


Asn 


Met 


Asn 


Lys 


Val 


Lys 




210 










215 










220 






Ala 


Arg 


Val 


He 


Leu 


Ser 


Gly 


Lys 


Ala 


Ser 


Asn 


Asn 


Pro 


Pro 


Phe 


Val 


225 










230 










235 










240 


He 


His 


Asp 


Met 


Glu 
245 


Thr 


Leu 


Cys 


Met 


Ala 
250 


Glu 


Lys 


Thr 


Leu 


Val 
255 


Ala 


Lys 


Leu 


Val 


Ala 


Asn 


Gly 


He 


Gin 


Asn 


Lys 


Glu 


Val 


Glu 


Val 


Arg 


He 








260 










265 










270 




Phe 


His 


Cys 
275 


Cys 


Gin 


Cys 


Thr 


Ser 
280 


Val 


Glu 


Thr 


Val 


Thr 
285 


Glu 


Leu 


Thr 


Glu 


Phe 


Ala 


Lys 


Ala 


He 


Pro 


Ala 


Phe 


Ala 


Asn 


Leu 


Asp 


Leu 


Asn 


Asp 




290 










295 










300 






Gin 


Val 


Thr 


Leu 


Leu 


Lys 


Tyr Gly 


Val 


Tyr 


Glu 


Ala 


He 


Phe 


Ala 


Met 


305 










310 










315 










320 


Leu 


Ser 


Ser 


Val 


Met 


Asn 


Lys 


Asp 


Gly Met 


Leu 


Val 


Ala 


Tyr 


Gly 


Asn 










325 










330 










335 




Gly 


Phe 


He 


Thr 


Arg 


Glu 


Phe 


Leu 


Lys 


Ser 


Leu 


Arg 


Lys 


Pro 


Phe 


Cys 








340 










345 










350 




Asp 


He 


Met 

355 


Glu 


Pro 


Lys 


Phe 


Asp 
360 


Phe 


Ala 


Met 


Lys 


Phe 
365 


Asn 


Ala 


Leu 


Glu 


Leu 


Asp 


Asp 


Ser 


Asp 


He 


Ser 


Leu 


Phe 


Val 


Ala 


Ala 


He 


He 


Cys 




370 










375 










380 








Cys 


Gly 


Asp 


Arg 


Pro 


Gly 


Leu 


Leu 


Asn 


Val 


Gly 


His 


He 


Glu 


Lys 


Met 


385 










390 










395 








400 


Gin 


Glu 


Gly 


He 


Val 


His 


Val 


Leu 


Arg Leu 


His 


Leu 


Gin 


Ser 


Asn 


His 










405 










410 










415 




Pro 


Asp 


Asp 


He 


Phe 


Leu 


Phe 


Pro 


Lys 


Leu 


Leu 


Gin 


Lys 


Met 


Ala 


Asp 








420 










425 










430 




Leu 


Arg 


Gin 


Leu 


Val 


Thr 


Glu 


His 


Ala 


Gin 


Leu 


Val 


Gin 


He 


He 


Lys 






435 










440 










445 






Lys 


Thr 


Glu 


Ser 


Asp 


Ala 


Ala 


Leu 


His 


Pro 


Leu 


Leu 


Gin 


Glu 


He 


Tyr 



450 455 460 



Arg Asp Met Tyr 
465 

(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1854 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGCCCAGGCT GAAGCTCAGG GCCCTGTCTG CTCTGTGGAC TCAACAGTTT GTGGCAAGAC 60 

AAGCTCAGAA CTGAGAAGCT GTCACCACAG TTCTGGAGGC TGGGAAGTTC AAGATCAAAG 120 

TGCCAGCAGA TTCAGTGTCA TGTGAGGACG TGCTTCCTGC TTCATAGATA AGAGTAGCTT 180 

GGAGCTCGGC GGCACAACCA GCACCATCTG GTCGCGATGG TGGACACGGA AAGCCCACTC 240 

TGCCCCCTCT CCCCACTCGA GGCCGGCGAT CTAGAGAGCC CGTTATCTGA AGAGTTCCTG 300 

CAAGAAATGG GAAACATCCA AGAGATTTCG CAATCCATCG GCGAGGATAG TTCTGGAAGC 360 

TTTGGCTTTA CGGAATACCA GTATTTAGGA AGCTGTCCTG GCTCAGATGG CTCGGTCATC 420 

ACGGACACGC TTTCACCAGC TTCGAGCCCC TCCTCGGTGA CTTATCCTGT GGTCCCCGGC 480 

AGCGTGGACG AGTCTCCCAG TGGAGCATTG AACATCGAAT GTAGAATCTG CGGGGACAAG 540 

GCCTCAGGCT ATCATTACGG AGTCCACGCG TGTGAAGGCT GCAAGGGCTT CTTTCGGCGA 600 

ACGATTCGAC TCAAGCTGGT GTATGACAAG TGCGACCGCA GCTGCAAGAT CCAGAAAAAG 660 

AACAGAAACA AATGCCAGTA TTGTCGATTT CACAAGTGCC TTTCTGTCGG GATGTCACAC 720 

AACGCGATTC GTTTTGGACG AATGCCAAGA TCTGAGAAAG CAAAACTGAA AGCAGAAATT 780 

CTTACCTCTG AACATGACAT AGAAGATTCT GAAACTGCAG ATCTCAAATC TCTGGCCAAG 840 

AGAATCTACG AGGCCTACTT GAAGAACTTC AACATGAACA AGGTCAAAGC CCGGGTCATC 900 

CTCTCAGGAA AGGCCAGTAA CAATCCACCT TTTGTCATAC ATGATATGGA GACACTGTGT 960 

ATGGCTGAGA AGACGCTGGT GGCCAAGCTG GTGGCCAATG GCATCCAGAA CAAGGAGGTG 1020 

GAGGTCCGCA TCTTTCACTG CT6CCAGTGC ACGTCAGTGG AGACCGTCAC GGAGCTCACG 1080 

GAATTCGCCA AGGCCATCCC AGCGTTCGCA AACTTGGACC TGAACGATCA AGTGACATTG 1140 

CTAAAATACG GAGTTTATGA GGCCATATTC GCCATGCTGT CTTCTGTGAT GAACAAAGAC 1200 

GGGATGCTCG TAGCGTATGG AAATGGGTTT ATAACTCGTG AATTCCTAAA AAGCCTAAGG 1260 

AAACCGTTCT GTGATATCAT GGAACCCAAG TTTGATTTTG CCATGAAGTT CAATGCACTG 1320 

GAACTGGATG ACAGTGATAT CTCCCTTTTT GTGGCTGCTA TCATTTGCTG TGGAGATCGT 1380 

CCTGGCCTTC TAAACGTAGG ACACATTGAA AAAATGCAGG AGGGTATTGT ACATGTGCTC 1440 

AGACTCCACC TGCAGAGCAA CCACCCGGAC GATATCTTTC TCTTCCCAAA ACTTCTTCAA 1500 

AAAATGGCAG ACCTCCGGCA GCTGGTGACG GA6CATGCGC AGCTGGTGCA GATCATCAAG 1560 

AAGACGGAGT CGGATGCTGC GCTGCACCCG CTACTGCAGG AGATCTACAG GGACATGTAC 1620 

TGAGTTCCTT CAGATCAGCC ACACCTTTTC CAGGAGTTCT 6AAGCTGACA GCACTACAAA 1680 

GGAGACGGGG GAGCAGCACG ATTTTGCACA AATATCCACC ACTTTAACCT TAGAGCTTGG 1740 

ACAGTCTGAG CTGTAGGTAA CCGGCATATT ATTCCATATC TTTGTTTTAA CCTiGTACTTC 1800 

TAAGAGCATA GAACTCAAAT GCTGGGGGAG GTGGCTAATC TCAGGACTGG GAAG 1854 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Thr Met Val Asp Thr Glu He Ala Phe Trp Pro Thr Asn Phe Gly 
15 10 15 
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He 


Ser 


Ser 


Val 


Asp 


Leu 


Ser 


Val 


Met 








20 










25 


Asp 


He 


Lys 


Pro 


Phe 


Thr 


Thr 


Val 


Asp 






35 










40 




His 


Tyr 


Glu 


Asp 


He 


Pro 


Phe 


Thr 


Arg 




50 










55 






Tyr 


Lys 


Tyr 


Asp 


Leu 


Lys 


Leu 


Gin 


Glu 


65 










70 








Glu 


Pro 


Ala 


Ser 


Pro 


Pro 


Tyr Tyr 


Ser 










85 










Lys 


Pro 


His 


Glu 


Glu 


Pro 


Ser 


Asn 


Ser 








100 










105 


Val 


Cys 


Gly 


Asp 


Lys Ala Ser Gly 


Phe 






115 










120 




Glu 


Gly 


Cys 


Lys 


Gly 


Phe 


Phe 


Arg 


Arg 




130 










135 






Tyr 


Asp 


Arg 


Cys 


Asp Leu 


Asn 


Cys 


Arg 


145 










150 








Lys 


Cys 


Gin 


Tyr 


Cys 


Arg 


Phe 


Gin 


Lys 










165 










His 


Asn 


Ala 


He 


Arg 


Phe Gly 


Arg 


He 








180 










185 


Leu 


Leu 


Ala 


Glu 


He 


Ser 


Ser 


Asp 


He 






195 










200 




Ala 


Asp Leu Arg 


Gin 


Ala 


Leu 


Ala 


Lys 




210 










215 






Lys 


Ser 


Phe 


Pro 


Leu 


Thr 


Lys 


Ala 


Lys 


225 










230 








Lys 


Thr 


Thr 


Asp 


Lys 


Ser 


Pro 


Phe 


Val 










245 










Met 


Met Gly Glu 


Asp 


Lys 


He 


Lys 


Phe 








260 










265 


Glu 


Gin 


Ser 


Lys 


Glu 


Val 


Ala 


He 


Arg 






275 










280 




Arg 


Ser 


Val 


Glu 


Ala 


Val 


Gin 


Glu 


He 




290 










295 






Pro 


Gly 


Phe 


Val 


Asn 


Leu Asp Leu 


Asn 


305 










310 








Tyr 


Gly Val His 


Glu 


He 


He 


Tyr 


Thr 










325 










Lys 


Asp Gly Val 


Leu 


He 


Ser Glu Gly 








340 










345 


Phe 


Leu 


Lys 


Ser 


Leu Arg Lys 


Pro 


Phe 






355 










360 




Phe 


Glu 


Phe 


Ala 


Val 


Lys 


Phe 


Asn 


Ala 




370 










375 






Leu 


Ala 


He 


Phe 


He 


Ala 


Val 


He 


He 


385 










390 








Leu 


Leu 


Asn 


Val 


Lys 


Pro 


He Glu Asp 










405 










Ala 


Leu 


Glu 


Leu 


Gin 


Leu 


Lys 


Leu 


Asn 








420 










425 


Phe 


Ala 


Lys 


Leu 


Leu Gin Lys Met Thr 






435 










440 




Glu 


His 


Val 


Gin 


Leu 


Leu Gin 


Val 


He 




450 










455 
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Glu 


Asp 


His 


Ser 


His 


Ser 


Phe 










30 






Phe 


Ser 


Ser 


He 


Ser 


Thr 


Pro 








45 








Thr Asp 


Pro 


Val 


Val 


Ala 


Asp 






60 










Tyr Gin 


Ser 


Ala 


He 


Lys 


Val 




75 








80 


Glu 


Lys 


Thr 


Gin 


Leu 


Tyr 


Asn 


90 










95 




Leu 


Met 


Ala 


He 


Glu 


Cys 


Arg 










110 






His 


Tyr 


Gly 


Val 


His 


Ala 


Cys 








125 








Thr 


He 


Arg 


Leu 


Lys 


Leu 


He 






140 










He 


His 


Lys 


Lys 


Ser 


Arg 


Asn 




155 










160 


Cys 


Leu 


Ala 


Val 


Gly 


Met 


Ser 


170 










175 




Ala 


Gin 


Ala 


Glu 


Lys 


Glu 


Lys 










190 






Asp Gin 


Leu 


Asn 


Pro 


Glu 


Ser 








205 








His 


Leu 


Tyr 


Asp 


Ser 


Tyr 


He 






220 










Ala Arg 


Ala 


He 


Leu 


Thr 


Gly 




235 










240 


He 


Tyr 


Asp 


Met 


Asn 


Ser 


Leu 


250 










255 




Lys 


His 


He 


Thr 


Pro 


Leu 


Gin 










270 






He 


Phe 


Gin 


Gly 


Cys 


Gin 


Phe 








285 








Thr 


Glu 


Tyr 


Ala 


Lys 


Ser 


He 






300 










Asp Gin 


Val 


Thr 


Leu 


Leu 


Lys 




315 










320 


Met 


Leu 


Ala 


Ser 


Leu 


Met 


Asn 


330 










335 




Gin Gly 


Phe 


Met 


Thr 


Arg 


Glu 










350 






Gly Asp 


Phe 


Met 


Glu 


Pro 


Lys 








365 








Leu 


Glu 


Leu 


Asp 


Asp 


Ser 


Asp 






380 










Leu 


Ser 


Gly 


Asp 


Arg 


Pro 


Gly 




395 










400 


He 


Gin 


Asp 


Asn 


Leu 


Leu 


Gin 


410 










415 




His 


Pro 


Glu 


Ser 


Ser 


Gin 


Leu 










430 






Asp Leu 


Arg 


Gin 


He 


Val 


Thr 








445 








Lys 


Lys 


Thr 


Glu 


Ihr 


Asp 


Met 



460 
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Ser Leu His Pro Leu Leu Gin Glu lie Tyr Lys Asp Leu Tyr 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CCGACCTTAC CCCAGGCGGC CTTGACGTTG GTCTTGTCGG CAGGAGACAG CACCATGGTG 60 

GGTTCTCTCT GAGTCTGGGA ATTCCCGAGC CCGAGCCGCA GCCGCCGCCT GGGGGGCTTG 120 

G6TCGGCCTC GAGGACACCG GAGAGGGGCG CCACGCCGCC GTGGCCGCAG AAATGACCAT 180 

GGTTGACACA GAGATCGCAT TCTGGCCCAC CAACTTTGGG ATCAGCTCCG TGGATCTCTC 240 

CGTAATGGAA GACCACTCCC ACTCCTTTGA TATCAAGCCC TTCACTACTG TTGACTTCTC 300 

CAGCATTTCT ACTCCACATT ACGAAGACAT TCCATTCACA AGAACAGATC CAGTGGTTGC 360 

AGATTACAAG TATGACCTGA AACTTCAAGA GTACCAAAGT GCAATCAAAG TGGAGCCTGC 420 

ATCTCCACCT TATTATTCTG AGAAGACTCA GCTCTACAAT AAGCCTCATG AAGAGCCTTC 480 

CAACTCCCTC ATGGCAATTG AATGTCGTGT CTGTGGAGAT AAAGCTTCTG GATTTCACTA 540 

TGGAGTTCAT GCTTGTGAAG GATGCAAGGG TTTCTTCCGG AGAACAATCA GATTGAAGCT 600 

TATCTATGAC AGATGTGATC TTAACTGTCG GATCCACAAA AAAAGTAGAA ATAAATGTCA 660 

GTACTGTCGG TTTCAGAAAT GCCTTGCAGT GGGGATGTCT CATAATGCCA TCAGGTTTGG 720 

GCGGATCGCA CAG6CCGAGA AGGAGAAGCT GTTGGCGGAG ATCTCCAGTG ATATCGACCA 780 

GCTGAATCCA GAGTCCGCTG ACCTCCGTCA GGCCCTGGCA AAACATTTGT ATGACTCATA 840 

CATAAAGTCC TTCCCGCTGA CCAAAGCAAA GGCGAGGGCG ATCTTGACAG GAAAGACAAC 900 

AGACAAATCA CCATTCGTTA TCTATGACAT GAATTCCTTA AT6ATGGGAG AAGATAAAAT 960 

CAAGTTCAAA CACATCACCC CCCTGCAGGA GCAGAGCAAA GAGGTGGCCA TCCGCATCTT 1020 

TCAGGGCTGC CAGTTTCGCT CCGTGGAGGC TGTGCAGGAG ATCACAGAGT ATGCCAAAAG 1080 

CATTCCTGGT TTTGTAAATC TTGACTTGAA CGACCAAGTA ACTCTCCTCA AATATGGAGT 1140 

CCACGAGATC ATTTACACAA TGCTGGCCTC CTTGATGAAT AAAGATGGGG TTCTCATATC 1200 

CGAGGGCCAA GGCTTCATGA CAAGGGAGTT TCTAAAGAGC CTGCGAAAGC CTTTTGGTGA 1260 

CTTTATGGAG CCCAAGTTTG AGTTTGCTGT GAAGTTCAAT GCACTGGAAT TAGATGACAG 1320 

CGACTTGGCA ATATTTATTG CTGTCATTAT TCTCAGTGGA GACCGCCCAG GTTTGCTGAA 1380 

TGTGAAGCCC ATTGAAGACA TTCAAGACAA CCTGCTACAA GCCCTGGAGC TCCAGCTGAA 1440 

GCTGAACCAC CCTGAGTCCT CACAGCTGTT TGCCAAGCTG CTCCAGAAAA TGACAGACCT 1500 

CAGACAGATT GTCACGGAAC ACGTGCAGCT ACTGCAGGTG ATCAAGAAGA CGGAGACAGA 1560 

CATGAGTCTT CACCCGCTCC TGCAGGAGAT CTACAAGGAC TTGTACTAGC AGAGAGTCCT 1620 

GAGCCACTGC CAACATTTCC CTTCTTCCAG TTGCACTATT CTGAGGGAAA ATCTGACCAT 1680 

AAGAAATTTA CTGTGAAAAA GCGTTTTAAA AAGAAAAGGG TTTAGAATAT GATCTATTTT 1740 

ATGCATATTG TTTATAAAGA CACATTTACA ATTTACTTTT AATATTAAAA ATTACCATAT 1800 

TATGAAATTG C 1811 

(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

Met Glu Gin Pro Gin Glu Glu Ala Pro Glu Val Arg Glu Glu Glu Glu 

15 10 15 

Lys Glu Glu Val Ala Glu Ala Glu Gly Ala Pro Glu Leu Asn Gly Gly 

20 25 30 

Pro Gin His Ala Leu Pro Ser Ser Ser Tyr Thr Asp Leu Ser Arg Ser 

35 40 45 

Ser Ser Pro Pro Ser Leu Leu Asp Gin Leu Gin Met Gly Cys Asp Gly 

50 55 60 

Ala Ser Cys Gly Ser Leu Asn Met Glu Cys Arg Val Cys Gly Asp Lys 
65 70 75 80 

Ala Ser Gly Phe His Tyr Gly Val His Ala Cys Glu Gly Cys Lys Gly 

85 90 95 

Phe Phe Arg Arg Thr lie Arg Met Lys Leu Glu Tyr Glu Lys Cys Glu 

100 105 110 

Arg Ser Cys Lys lie Gin Lys Lys Asn Arg Asn Lys Cys Gin Tyr Cys 

115 120 125 

Arg Phe Gin Lys Cys Leu Ala Leu Gly Met Ser His Asn Ala lie Arg 

130 135 140 

Phe Gly Arg Met Pro Glu Ala Glu Lys Arg Lys Leu Val Ala Gly Leu 
145 150 155 160 

Thr Ala Asn Glu Gly Ser Gin Tyr Asn Pro Gin Val Ala Asp Leu Lys 

165 170 175 

Ala Phe Ser Lys His lie Tyr Asn Ala Tyr Leu Lys Asn Phe Asn Met 

180 185 190 

Thr Lys Lys Lys Ala Arg Ser lie Leu Thr Gly Lys Ala Ser His Thr 

195 200 205 

Ala Pro Phe Val lie His Asp lie Glu Thr Leu Trp Gin Ala Glu Lys 

210 215 220 

Gly Leu Val Trp Lys Gin Leu Val Asn Gly Leu Pro Pro Tyr Lys Glu 
225 230 235 240 

lie Ser Val His Val Phe Tyr Arg Cys Gin Cys Thr Thr Val Glu Thr 

245 250 255 

Val Arg Glu Leu Thr Glu Phe Ala Lys Ser lie Pro Ser Phe Ser Ser 

260 265 270 

Leu Phe Leu Asn Asp Gin Val Thr Leu Leu Lys Tyr Gly Val His Glu 

275 280 285 

Ala He Phe Ala Met Leu Ala Ser He Val Asn Lys Asp Gly Leu Leu 

290 295 300 

Val Ala Asn Gly Ser Gly Phe Val Thr Arg Glu Phe Leu Arg Ser Leu 
305 310 315 320 

Arg Lys Pro Phe Ser Asp He He Glu Pro Lys Phe Glu Phe Ala Val 

325 330 335 

Lys Phe Asn Ala Leu Qlu Leu Asp Asp Ser Asp Leu Ala Leu Phe He 

340 345 350 

Ala Ala He He Leu Cys Gly Asp Arg Pro Gly Leu Met Asn Val Pro 

355 360 365 

Arg Val Glu Ala He Gin Asp Thr He Leu Arg Ala Leu Glu Phe His 

370 375 380 

Leu Gin Ala Asn His Pro Asp Ala Gin Tyr Leu Phe Pro Lys Leu Leu 
385 390 395 400 

Gin Lys Met Ala Asp Leu Arg Gin Leu Val Thr Glu His Ala Gin Met 

405 410 415 
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Met Gin Arg lie Lys Lys Thr Glu Thr Glu Thr Ser Leu His Pro Leu 

420 425 430 

Leu Gin Glu lie Tyr Lys Asp Met Tyr 
435 440 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3301 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 

GAATTCTGCG GAGCCTGCGG GACGGCGGCG GGTTGGCCCG TAGGCAGCCG GGACAGTGTT 60 

GTACAGTGTT TTGGGCATGC ACGTGATACT CACACAGTGG CTTCTGCTCA CCAACAGATG 120 

AAQACAGATG CACCAACGAG GGTCTGGAAT GGTCTGGAGT GGTCTGGAAA GCAGGGTCAG 180 

ATACCCCTGG AAAACTGAAG CCCGTG6AGC AATGATCTCT ACAGGACTGC TTCAAGGCTG 240 

ATGGGAACCA CCCTGTAGAG GTCCATCTGC GTTCAGACCC AGACGATGCC AGAGCTATGA 300 

CTGGGCCTGC AGGTGTGGCG CCGAGGGCAG ATCAGCCATG GAGCAGCCAC AGGAGGAAGC 360 

CCCTGAGGTC CGGGAAGAGG AGGAGAAAGA GGAAGTGGCA GAGGCAGAAG GAGCCCCAGA 420 

GCTCAATGGG GGACCACAGC ATGCACTTCC TTCCAGCAGC TACACAGACC TCTCCCGGAG 480 

CTCCTCGCCA CCCTCACTGC TGGACCAACT GCAGATGGGC TGTGACGGGG CCTCATGCGG 540 

CAGCCTCAAC ATGGAGTGCC GGGTGTGCGG GGACAAGGCA TCGGGCTTCC ACTACGGTGT 600 

TCATGCATGT GAGGGGTGCA AGGGCTTCTT CCGTCGTACG ATCCGCATGA AGCTGGAGTA 660 

CGAGAAGTGT GAGCGCAGCT GCAAGATTCA GAAGAAGAAC CGCAACAAGT GCCAGTACTG 720 

CCGCTTCCAG AAGTGCCTGG CACTGGGCAT GTCACACAAC GCTATCCGTT TTGGTCGGAT 780 

GCCGGAGGCT GAGAAGAGGA AGCTGGTGGC AGGGCTGACT GCAAACGAGG GGAGCCAGTA 840 

CAACCCACAG GTGGCCGACC TGAAGGCCTT CTCCAAGCAC ATCTACAATG CCTACCTGAA 900 

AAACTTCAAC ATGACCAAAA AGAAGGCCCG CAGCATCCTC ACCGGCAAAG CCAGCCACAC 960 

GGCGCCCTTT GTGATCCACG ACATCGAGAC ATTGTGGCAG GCAGAGAAGG GGCTGGTGTG 1020 

GAAGCAGTTG GTGAATGGCC TGCCTCCCTA CAAGGAGATC AGCGTGCACG TCTTCTACCG 1080 

CTGCCAGTGC ACCACAGTGG AGACC6TGCG GGAGCTCACT GAGTTCGCCA AGAGCATCCC 1140 

CAGCTTCAGC AGCCTCTTCC TCAACGACCA GGTTACCCTT CTCAAGTATG GCGTGCACGA 1200 

GGCCATCTTC GCCATGCTGG CCTCTATCGT CAACAAGGAC GGGCTGCTGG TAGCCAACGG 1260 

CAGTGGCTTT GTCACCCGTG AGTTCCTGCG CAGCCTCCGC AAACCCTTCA 6TGATATCAT 1320 

TGAGCCTAAG TTTGAATTTG CTGTCAAGTT CAACGCCCTG GAACTTGATG ACAGTGACCT 1380 

GGCCCTATTC ATTGCGGCCA TCATTCTGTG TGGAGACCGG CCAGGCCTCA TGAACGTTCC 1440 

ACGGGTGGAG GCTATCCAGG ACACCATCCT GCGTGCCCTC GAATTCCACC TGCAGGCCAA 1500 

CCACCCTGAT GCCCAGTACC TCTTCCCCAA GCTGCTGCAG AAGATGGCTG ACCTGCGGCA 1560 

ACTGGTCACC GAGCACGCCC AGATGATGCA GCGGATCAAG AAGACCGAAA CCGAGACCTC 1620 

GCTGCACCCT CTGCTCCAGG AGATCTACAA GGACATGTAC TAACGGCGGC ACCCAGGCCT 1680 

CCCTGCAGAC TCCAATGGG6 CCAGCACTGG AGGGGCCCAC CCACATGACT TTTCCATTGA 1740 

CCAGCTCTCT TCCTGTCTTT GTTGTCTCCC TCTTTCTCAG TTCCTCTTTC TTTTCTAATT 1800 

CCTGTTGCTC TGTTTCTTCC TTTCTGTAGG TTTCTCTCTT CCCTTCTCCC TTCTCCCTTG 1860 

CCCTCCCTTT CTCTCTCCTA TCCCCACGTC TGTCCTCCTT TCTTATTCTG TGAGATGTTT 1920 

TGTATTATTT CACCAGCAGC ATAGAACAGG ACCTCTGCTT TTGCACACCT TTTCCCCAGG 1980 

AGCAGAAGAG AGTGGGCCTG CCCTCTGCCC CATCATTGCA CCTGCAGGCT TAGGTCCTCA 2040 

CTTCTGTCTC CTGTCTTCAG AGCAAAAGAC TTGAGCCATC CAAAGAAACA CTAAGCTCTC 2100 

TGGGCCTGGG TTCCAGGGAA GGCTAAGCAT GGCCTGGACT GACTGCAGCC CCCTATAGTC 2160 

ATGGGGTCCC TGCTGCAAAG GACAGTGGCA GACCCCGGCA GTAGAGCCGA GATGCCTCCC 2220 

CAAGACTGTC ATTGCCCCTC CGATCGTGAG GCCACCCACT GACCCAATGA TCCTCTCCAG 2280 

CAGCACACCT CAGCCCCACT GACACCCAGT GTCCTTCCAT CTTCACACTG 6TTTGCCAGG 2340 
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CCAATGTTGC TGATGGCCCC TCCAGCACAC ACACATAAGC ACTGAAATCA CTTTACCTGC 2400 

AGGCACCATG CACCTCCCTT CCCTCCCTGA GGCAGGTGAG AACCCAGAGA GAGGGGCCTG 2460 

CAGGTGAGCA GGCAGGGCTG GGCCAGGTCT CCGGGGAGGC AGGGGTCCTG CAGGTCCTGG 2520 

TGGGTCAGCC CAGCACCTCG CCCAGTGGGA GCTTCCCGGG ATAAACTGAG CCTGTTCATT 2580 

CTGATCTCCA TTTGTCCCAA TAGCTCTACT GCCCTCCCCT TCCCCTTTAC TCAGCCCAGC 2640 

TGGCCACCTA GAAGTCTCCC TGCACAGCCT CTAGTGTCCG GGGACCTTGT GGGACCAGTC 2700 

CCACACCGCT GGTCCCTGCC CTCCCCTGCT CCCAGGTTGA GGTGCGCTCA CCTCAGAGCA 2760 

GGGCCAAAGC ACAGCTGGGC ATGCCATGTC TGAGCGGCGC AGAGCCCTCC AGGCCTGCAG 2820 

GGGCAAGGGG CTGGCTGGAG TCTCAGAGCA CAGAGGTAGG AGAACTGGGG TTCAAGCCCA 2880 

GGCTTCCTGG GTCCTGCCTC GTCCTCCCTC CCAAGGAGCC ATTCTATGTG ACTCTGGGTG 2940 

GAAGTGCCCA GCCCCTGCCT GACGGNNNNN NNGATCACTC TCTGCTGGCA GGATTCTTCC 3000 

CGCTCCCCAC CTACCCAGCT GATGGGGGTT GGGGTGCTTC TTTCAGCCAA GGCTATGAAG 3060 

GGACAGCTGC TGGGACCCAC CTCCCCCCTT CCCCGGCCAC ATGCCGCGTC CCTGCCCCCA 3120 

CCCGGGTCTG GTGCTGAGGA TACAGCTCTT CTCAGTGTCT GAACAATCTC CAAAATTGAA 3180 

ATGTATATTT TTGCTAGGAG CCCCAGCTTC CTGTGTTTTT AATATAAATA GTGTACACAG 3240 

ACTGACGAAA CTTTAAATAA ATGGGAATTA AATATTTAAA AAAAAAAGCG GCCGCGAATT 3300 

C 3301 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
ACTCGGATCC AAGCCATGGC TGAGAACTTG CTGGACGG 38 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CACAAAGCTT AGGCCATGTT AGCACTGTTC GG 32 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CTCAGTCGAC TTATTGAATT CCACTAGCTG GAGATCC 37 
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